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MAMMALIAN SIMP PROTEIN, GENE SEQUENCE 
AND USES THEREOF IN CANCER THERAPY 



BACKGROUND OF THE INVENTION 



10 



a) 



Field of the invention 



The present invention is concerned with a protein called "SIMP" that is a 
Source of immunodominant MHC-associated Peptides and more particularly to the 
use of SIMP nucleic acids, proteins, fragments, antibodies, probes, and cells, to 
characterize SIMP, modulate its cellular levels, diagnose and treat cancers and 
modulate an immune response. 



b) Brief description of the prior art 

Adoptive immunotherapy is a main approach that is currently being 

15 investigated in the field of cancer immunotherapy. Adoptive immunotherapy 
involves injection of lymphocytes (or of lymphocyte receptor(s) transfected into 
another cell type) from one individual to an other. According to this approach, 
patients with cancer are treated by allogeneic hematopoietic cell transplant 
(AHCT) from a cancer-free donor. Following AHCT, eradication of cancer cells is 

20 primarily mediated by a donor T-cell dependent immune reaction commonly 
referred to as the graft-versus-tumor (GVT) effect. 

Recently, one of the present inventors has shown that it is possible to 
transfer T-cells from a donor to a compatible recipient without causing to the latter 
a graft-versus-host disease (GVHD) reaction (International PCT application 

25 PCT/CA01/01477; and Fontaine et al. } (2001). Nat.Med. 7:789-794). These 
experiments, which were carried out in mice, were based on the priming of T-cells 
specifically reacting against B6 dom1 , a selected immunodominant ubiquitous MiHA. 
Although the immunogenic properties of B6 dom1 have been characterised (Eden et 
a/., (1999) J. Immunol. 162:4502-4510), the identity of the gene/protein from which 

30 B6 dom1 was derived and whether a human homolog existed was unknown until 
now. 
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Given that B6 dom1 peptide(s) seemed to represent an ideal target for 
adoptive cancer immunotherapy, there is thus a need to identify the human 
homolog of B6 dom1 . 

There is also a need for a human protein and a nucleic acid encoding the 
same, that is expressed ubiquitously in human cells and which has the potential of 
generating a plurality of protein fragments binding with high affinity to human MHC 
molecules, and more particularly human HLA molecules. 

The present invention fulfils this need and also other needs as it will be 
apparent to those skilled in the art upon reading the following specification. 



SUMMARY OF THE INVENTION 

The present inventors have discovered a protein called "SIMP" (Source of 
immunodominant MHC-associated Peptides) which is a human homolog of the 
mouse gene encoding B6 dom1 . The present inventors have also discovered uses 
15 for human SIMP proteins, fragments, nucleic acids, and antibodies for modulating 
its cellular levels, for diagnosing and treating cancers, and for modulating immune 
response 

In general, the invention features an isolated or purified nucleic acid 
molecule, such as genomic, cDNA, antisense DNA, RNA or a synthetic nucleic 
20 acid molecule that encodes or corresponds to a human SIMP polypeptide. 

According to a first aspect, the invention features isolated or purified nucleic 
acid molecules, polynucleotides, polypeptides, human proteins and fragment 
thereof. 

In a first embodiment, the isolated or purified nucleic acid molecule encodes 
25 a human protein that is expressed ubiquitously in human cells, the protein having 
the potential of generating a plurality of protein fragments binding with high affinity 
to a human HLA molecule. Preferably, the HLA molecule is selected from the HLA 
molecules listed in Table 1 . Preferably, the protein fragments are selected from the 
peptides listed in Table 1 as well. 
30 In another embodiment, the invention provides an isolated or purified 

human protein that is expressed ubiquitously in human cells, the protein having the 
potential of generating a plurality of protein fragments that bind with high affinity to 
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a human HLA molecule. In further embodiments, there is provided polypeptides 
comprising a definite amino acid sequence. 

In preferred embodiments of the invention, the human protein is 
overexpressed in proliferative cells, such as tumoral cells, and expression of the 
5 protein is essential for the tumoral cell's survival. More preferably, the human 
protein is a functional or structural homolog of yeast STT3 (SEQ ID NO: 6) and/or 
a paralog of human ITM1 (SEQ ID NO: 12). 

According to a specific embodiment, the nucleic acid of the invention 
comprises a polynucleotide having a nucleotide sequence coding an amino acid 
10 sequence selected from the group consisting of: 

a) an amino acid sequence having greater than 71% amino acid sequence 
identity to SEQ ID NO:8; 

b) an amino acid sequence having greater than 71% amino acid sequence 
identity to an amino acid sequence encoded by an open reading frame having 

15 SEQ ID NO:7; 

c) an amino acid sequence having greater than 82% amino acid sequence 
homology to SEQ ID NO: 8; 

d) an amino acid sequence having greater than 82% amino acid sequence 
homology to an amino acid sequence encoded by an open reading frame 

20 having SEQ ID NO: 7; 

e) an amino acid sequence having greater than 97% amino acid sequence 
identity to SEQ ID NO: 2; 

f) an amino acid sequence having greater than 97% amino acid sequence 
identity to an amino acid sequence encoded by an open reading frame having 

25 SEQ ID NO: 1; 

g) an amino acid sequence having greater than 97% amino acid sequence 
homology to SEQ ID NO: 2; and 

h) an amino acid sequence having greater than 97% amino acid sequence 
homology to an amino acid sequence encoded by an open reading frame 

30 having SEQ ID NO: 1. 

More preferably, the nucleic acid comprises a polynucleotide having a 
nucleotide sequence coding an amino acid sequence 100% identical to SEQ ID 
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NO: 2 and/or 100% identical to an amino acid sequence encoded by an open 
reading frame having SEQ ID NO: 1. 

According to another specific embodiment, the nucleic acid of the invention 
comprises a polynucleotide having a nucleotide sequence selected from the group 
5 consisting of: 

a) a nucleotide sequence having greater than 63% nucleotide sequence identity 
with SEQ ID NO:7; 

b) a nucleotide sequence having greater than 63% nucleotide sequence identity 
with a nucleic acid encoding an amino acid sequence of SEQ ID NO:8; 

10 c) a nucleotide sequence having at least 91% nucleotide sequence identity with 
SEQ ID NO: 1; and 

d) a nucleotide sequence having at least 91% nucleotide sequence identity with a 
nucleic acid encoding an amino acid sequence of SEQ ID NO: 2. 

More preferably, the nucleic acid comprises a polynucleotide 100% identical 
15 to SEQ ID NO: 1. 

According to another aspect, the invention features an isolated or purified 
nucleic acid molecule which comprises a polynucleotide having a definite 
nucleotide sequence selected from the group consisting of: 

a) a nucleotide sequence having greater than 63% nucleotide sequence identity 
20 with SEQ ID NO: 7; 

b) a nucleotide sequence having greater than 63% nucleotide sequence identity 
with a nucleic acid encoding an amino acid sequence of SEQ ID NO:8; 

c) a nucleotide sequence having at least 91% nucleotide sequence identity with 
SEQ ID NO: 1; 

25 d) a nucleotide sequence having at least 91% nucleotide sequence identity with a 
nucleic acid encoding an amino acid sequence of SEQ ID NO: 2; and 

e) a nucleotide sequence complementary to any of the nucleotide sequences in 
(a), (b), (c) or (d). 

Preferably the nucleic acid molecule comprises a polynucleotide having a 
30 nucleotide sequence selected from the group consisting of: 

a) a nucleotide sequence having at least 91% nucleotide sequence identity with 
SEQ ID NO: 1; 
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b) a nucleotide sequence having at least 91% nucleotide sequence identity with a 
nucleic acid encoding an amino acid sequence of SEQ ID NO: 2; and 

c) a nucleotide sequence complementary to any of the nucleotide sequences in 
(a) or (b). 

5 More preferably, the nucleic acid molecule comprises a polynucleotide 

having: 

a) a nucleotide sequence 100% identical to SEQ ID NO: 1; 

b) a nucleotide sequence complementary to SEQ ID NO: 1; and/or 

c) at least 15 nucleotides of the polynucleotide of (a) or (b). 

10 In a related aspect, the invention features an isolated or purified nucleic acid 

molecule which hybridizes under low, preferably high, stringency conditions to any 

of the nucleic acid molecules mentioned hereinabove. 

In a more specific aspect, the invention features an isolated or purified 

human nucleic acid molecule comprising a polynucleotide having the 
15 SEQ ID NO: 1, or degenerate variants thereof, and encoding a human SIMP 

polypeptide. Preferably, the nucleic acid is a cDNA and it encodes the amino acid 

sequence of SEQ ID NO: 2 or a fragment thereof. 

The invention also features substantially pure human polypeptides and 

proteins that are encoded by any of the above mentioned nucleic acids. In a 
20 preferred embodiment, the invention aims at an isolated or purified polypeptide 

comprising an amino acid sequence selected from the group consisting of: 

a) an amino acid sequence having greater than 71% amino acid sequence 
identity to SEQ ID NO: 8; 

b) an amino acid sequence having greater than 71% amino acid sequence 
25 identity to an amino acid sequence encoded by an open reading frame having 

SEQ ID NO: 7; 

c) an amino acid sequence having greater than 82% amino acid sequence 
homology to SEQ ID NO: 8; 

d) an amino acid sequence having greater than 82% amino acid sequence 
30 homology to an amino acid sequence encoded by an open reading frame 

having SEQ ID NO: 7; 
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e) an amino acid sequence having greater than 97% amino acid sequence 
identity to SEQ ID NO: 2; 

f) an amino acid sequence having greater than 97% amino acid sequence 
identity to an amino acid sequence encoded by an open reading frame having 

5 SEQ ID NO: 1; 

g) an amino acid sequence having greater than 97% amino acid sequence 
homology to SEQ ID NO: 2; and 

h) an amino acid sequence having greater than 97% amino acid sequence 
homology to an amino acid sequence encoded by an open reading frame 

10 having SEQ ID NO: 1 

More preferably, the polypeptide comprises an amino acid sequence 
selected from the group consisting of: 

a) an amino acid sequence 100% identical to SEQ ID NO: 2; 

b) an amino acid sequence 100% identical to an amino acid sequence encoded 
15 by an open reading frame having SEQ ID NO: 1; and 

c) an amino acid sequence consisting of at least eight consecutive amino acids of 
(a) or (b). 

In an even more specific aspect, the invention features a substantially pure 
human SIMP polypeptide, or a fragment thereof. Preferably, the SIMP polypeptide 
20 or fragment comprises an amino acid sequence having greater than 97% amino 
acid sequence homology, and more preferably 100%, with a polypeptide selected 
from the group consisting of: 

a) a polypeptide having SEQ ID NO: 2; 

b) a polypeptide having an amino acid sequence encoded by an open reading 
25 frame having SEQ ID NO: 1 ; and 

c) a polypeptide that is a fragment of (a) or (b). 

In a related aspect, the invention features an isolated or purified human 
protein that is a paralog of a human protein having SEQ ID NO: 12. Preferably the 
protein comprises an amino acid sequence having at least 25% identity or at least 
30 25% homology with SEQ ID NO: 12. Even more preferably, the percentages of 
identity and homology are of at least 50% and more specifically of about 56% and 
59% respectively. 
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The present invention also features protein fragments derived from any of 
the above mentioned protein or polypeptides. Accordingly, the present invention 
encompasses each of the polypeptides fragment listed in Table 1 and any 
fragment comprising at least eight sequential amino acids of SEQ ID NO:2 
5 (hSIMP) or of SEQ ID NO: 12 (hlTM1). Similarly, the invention further 
encompasses polypeptides fragment of comprising an amino acid sequence 
encoded by a nucleotide sequence comprising at least 24 sequential nucleic acid 
of SEQ ID NO:1 (hSIMP) or of SEQ ID NO:1 1 (hlTM1). 

The present invention further features an antisense nucleic acid and a 

10 pharmaceutical composition comprising the same. According to a first 
embodiment, the antisense hybridizes under high stringency condition to SEQ ID 
NO: 1 or to a complementary sequence thereof. According to another 
embodiment, the antisense hybridizes under high stringency conditions to a 
genomic sequence or to a mRNA so that it reduces human SIMP cellular levels of 

15 expression. Preferably, the antisense is complementary to a nucleic acid sequence 
encoding a protein having SEQ ID NO: 1 or encoding a fragment of this protein. 

In a related aspect, the present invention further features a method for 
modulating tumoral cell survival or for eliminating a tumoral cell in a mammal, the 
method comprising the step of reducing cellular expression levels of a SIMP 

20 polypeptide. Preferably, the method comprises the step of delivering a human 
SIMP antisense into the tumoral cell. 

Furthermore, the present invention features a method for eliminating 
tumoral cells in a mammal, preferably a human. The method comprises the step of 
injecting, into the mammal's circulatory system, T-lymphocytes that recognize a 

25 immune complex that is present at the surface of the tumoral cells, the immune 
complex consisting of a SIMP protein fragment or a ITM1 protein fragment bound 
to an MHC molecule. Preferably, the immune complex consists of a human SIMP 
protein fragment bound to a HLA molecule, the human SIMP protein fragment 
comprising at least eight sequential amino acids of SEQ ID NO: 2. Even more 

30 preferably, the hSIMP protein fragment is selected from the peptides listed in 
Table 1. 
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The present invention also features a method for increasing cell proliferation 
in a mammal, comprising the step of: i) contacting the cell with a SIMP 
polypeptide; and/or ii) increasing cellular expression levels of a SIMP polypeptide. 

The present invention further features a method for modulating an immune 
5 response in a mammal, preferably a human, comprising increasing the cellular 
expression levels of a SIMP polypeptide in the lymphoid cells of the mammals. In a 
preferred embodiment, the method is used for increasing the level and/or the 
duration of an antigen-primed lymphocyte proliferation. Preferably, the method 
comprises the transfection of lymphocytes with a cDNA coding for a SIMP 
10 polypeptide. 

The present invention features also a method for decreasing lymphoid cells 
proliferation, comprising decreasing in these cells cellular expression levels of a 
SIMP polypeptide. In a preferred embodiment, the method is used for suppressing 
an immune response responsible for an autoimmune disease or a transplant 
15 rejection. Preferably, the method comprises the delivery of a SIMP antisense into 
the lymphoid cells. 

According to another aspect, the invention features a nucleotide probe 
comprising a sequence of at least 15 sequential nucleotides of SEQ ID NO: 1 or of 
a sequence complementary to SEQ ID NO:1. The invention also encompasses a 

20 substantially pure nucleic acid that hybridizes under low, preferably high, 
stringency conditions to a probe of at least 40 nucleotides in length that is derived 
from SEQ ID NO:1. 

According to another aspect, the invention features a purified antibody. In a 
preferred embodiment, the antibody specifically binds to a purified mammalian 

25 SIMP polypeptide. Preferably, the antibody binds to a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 2 and SEQ ID 
NO: 4. In another embodiment, the invention provides a monoclonal or polyclonal 
antibody which recognizes any of the human SIMP proteins, polypeptides, or 
fragments defined hereinabove. 

30 According to a further aspect, the invention features a method for 

determining the amount of a SIMP polypeptide in a biological sample, the method 
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comprising the step of contacting the sample with an antibody or with a probe as 
defined previously. 

In a related aspect, the invention features a method of diagnosis of a cancer 
in a human subject. The method comprises the step of determining the amount of 
5 a human SIMP polypeptide in a cell or a biological sample from a human subject, 
wherein the amount of SIMP is indicative of a probability for this subject to harbor 
proliferating tumoral cells. The method is particularly useful for detecting 
proliferating tumoral cells that grow rapidly and display a short doubling time. Such 
tumoral cells are commonly found in lung cancers, intestine cancers, sarcomas, 

10 prostate cancer, testis cancer, breast cancer, melanomas, pancreatic cancer 
prostate cancer and hematologic cancers. 

In another related aspect, the invention features a kit for determining the 
amount of a SIMP polypeptide in a sample, the kit comprising an antibody or a 
probe as defined previously, and at least one element selected from the group 

15 consisting of instructions for using the kit, reaction buffer(s), and enzyme(s). 

The nucleic acids of the invention may be incorporated into a vector and or 
a cell (such as a mammalian, yeast, nematode or bacterial cell). The nucleic acids 
may also be incorporated into a transgenic animal or embryo thereof. Therefore, 
the present invention features cloning or expression vectors, transformed or 

20 transfected cells and transgenic animals that contain any of the nucleic acids of 
the invention and more particularly those encoding a SIMP protein, polypeptide or 
fragment. 

In a related aspect, the invention features a method for producing a human 
SIMP polypeptide comprising: 
25 - providing a cell transformed with a nucleic acid sequence encoding a human 

SIMP polypeptide positioned for expression in this cell; 

- culturing the transformed cell under conditions suitable for expressing the 
nucleic acid; and 

- producing the hSIMP polypeptide. 

30 One of the greatest advantages of the present invention is that it provides 

nucleic acid molecules, proteins, polypeptides, antibodies, probes, and cells that 
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can be used for characterizing SIMP, modulate its cellular levels, diagnose and 
treat cancers and modulate an immune response. 

Other objects and advantages of the present invention will be apparent 
upon reading the following non-restrictive description of the preferred 
5 embodiments thereof and from the claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a graph showing the assessment of peptide recognition by 
10 C3H.SW anti-C57BL/6 cytotoxic T-lymphocytes (CTLs). 

DETAILED DESCRIPTION OF THE INVENTION 

A) Definitions 

Throughout the text, the word "kilobase" is generally abbreviated as "kb", 
15 the words "deoxyribonucleic acid" as "DNA", the words "ribonucleic acid" as "RNA", 
the words "complementary DNA" as "cDNA", the words "polymerase chain 
reaction" as "PCR", and the words "reverse transcription" as "RT". Nucleotide 
sequences are written in the 5* to 3' orientation unless stated otherwise. 

In order to provide an even clearer and more consistent understanding of 
20 the specification and the claims, including the scope given herein to such terms, 
the following definitions are provided: 

Antisense: as used herein in reference to nucleic acids, is meant a nucleic 
acid sequence, regardless of length, that is complementary to the coding strand of 
a gene. 

25 Expression: refers to the process by which gene encoded information is 

converted into the structures present and operating in the cell. In the case of 
cDNAs, cDNA fragments and genomic DNA fragments, the transcribed nucleic 
acid is subsequently translated into a peptide or a protein in order to carry out its 
function if any. The terms "overexpression" refer to an upward deviation 

30 respectively in assayed levels of expression as compared to a baseline expression 
level which is the level of expression that is found under normal conditions and 
normal level of functioning (e.g. non tumoral cells). By "positioned for 
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expression" is meant that the DNA molecule is positioned adjacent to a DNA 
sequence which directs transcription and translation of the sequence (i.e., 
facilitates the production of, e.g., a NAIP polypeptide, a recombinant protein or a 
RNA molecule). 

5 Fragment: Refers to a section of a molecule, such as a protein, a 

polypeptide or a nucleic acid, and is meant to refer to any portion of the amino acid 
or nucleotide sequence. 

Homolog: refers to a nucleic acid molecule or polypeptide that shares 
similarities in DNA or protein sequences. 

10 Host: A cell, tissue, organ or organism capable of providing cellular 

components for allowing the expression of an exogenous nucleic acid embedded 
into a vector or a viral genome, and for allowing the production of viral particles 
encoded by such vector or viral genome. This term is intended to also include 
hosts which have been modified in order to accomplish these functions. Bacteria, 

15 fungi, animal (cells, tissues, or organisms) and plant (cells, tissues, or organisms) 
are examples of a host. 

Isolated or Purified or Substantially pure: Means altered "by the hand of 
man" from its natural state, i.e., if it occurs in nature, it has been changed or 
removed from its original environment, or both. For example, a polynucleotide or a 

20 protein/peptide naturally present in a living organism is not "isolated", the same 
polynucleotide separated from the coexisting materials of its natural state, 
obtained by cloning, amplification and/or chemical synthesis is "isolated" as the 
term is employed herein. Moreover, a polynucleotide or a protein/peptide that is 
introduced into an organism by transformation, genetic manipulation or by any 

25 other recombinant method is "isolated" even if it is still present in said organism. 

Nucleic acid: Any DNA, RNA sequence or molecule having one nucleotide 
or more, including nucleotide sequences encoding a complete gene. The term is 
intended to encompass all nucleic acids whether occurring naturally or non- 
naturally in a particular cell, tissue or organism. This includes DNA and fragments 

30 thereof, RNA and fragments thereof, cDNAs and fragments thereof, expressed 
sequence tags, artificial sequences including randomized artificial sequences. 
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Open reading frame ("ORF"): The portion of a cDNA that is translated into 
a protein. Typically, an open reading frame starts with an initiator ATG codon and 
ends with a termination codon (TAA, TAG or TGA). 

Paralog: As used herein, refers to a protein or a polypeptide that is 
5 encoded by a gene locus that has arisen through evolution by gene duplication in 
one species. 

Polypeptide: means any chain of more than two amino acids, regardless of 
post-translational modification such as glycosylation or phosphorylation. 

SIMP nucleic acid: means any nucleic acid (see above) encoding a 

10 mammalian polypeptide that has the potential of generating a plurality of protein 
fragments binding with high affinity to MHC molecules, and having at least 90%, 
preferably at least 95% and most preferably 100% identity or homology to the 
amino acid sequence shown in SEQ. ID. NO: 2 (human) or 4 (mouse). When 
referring to a human SIMP nucleic acid, the nucleic acid encoding SEQ. ID. NO: 2 

15 is more particularly concerned. SIMP protein or SIMP polypeptide: means a 
polypeptide, or fragment thereof, encoded by a SIMP nucleic acid as described 
above. 

Specifically binds: means an antibody that recognizes and binds a protein 
but that does not substantially recognize and bind other molecules in a sample, 

20 e.g., a biological sample, that naturally includes protein. 

Substantially identical: means a polypeptide or nucleic acid exhibiting at 
least 50%, preferably 85%, more preferably 90%, and most preferably 95% 
homology to a reference amino acid or nucleic acid sequence. For polypeptides, 
the length of comparison sequences will generally be at least 16 amino acids, 

25 preferably at least 20 amino acids, more preferably at least 25 amino acids, and 
most preferably 35 amino acids. For nucleic acids, the length of comparison 
sequences will generally be at least 50 nucleotides, preferably at least 60 
nucleotides, more preferably at least 75 nucleotides, and most preferably 110 
nucleotides. Sequence identity is typically measured using sequence analysis 

30 software with the default parameters specified therein (e.g., Sequence Analysis 
Software Package of the Genetics Computer Group, University of Wisconsin 
Biotechnology Center, 1710 University Avenue, Madison, Owl 53705). This 
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software program matches similar sequences by assigning degrees of homology 
to various substitutions, deletions, and other modifications. Conservative 
substitutions typically include substitutions within the following groups: glycine, 
alanine, valine, isoleucine, leucine; aspartic acid, glutamic acid, asparagine, 
5 glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. More 
particularly, "substantially pure polypeptide" means a polypeptide that has been 
separated from the components that naturally accompany it. Typically, the 
polypeptide is substantially pure when it is at least 60%, by weight, free from the 
proteins and naturally-occurring organic molecules with which it is naturally 

10 associated. Preferably, the polypeptide is a SIMP polypeptide that is at least 75%, 
more preferably at least 90%, and most preferably at least 99%, by weight, pure. A 
substantially pure SIMP polypeptide may be obtained, for example, by extraction 
from a natural source (e.g. a fibroblast, neuronal cell, or lymphocyte) by 
expression of a recombinant nucleic acid encoding a NAIP polypeptide, or by 

15 chemically synthesizing the protein. Purity can be measured by any appropriate 
method, e.g., by column chromatography, polyacrylamide gel electrophoresis, or 
HPLC analysis. A protein is substantially free of naturally associated components 
when it is separated from those contaminants which accompany it in its natural 
state. Thus, a protein which is chemically synthesized or produced in a cellular 

20 system different from the cell from which it naturally originates will be substantially 
free from its naturally associated components. Accordingly, substantially pure 
polypeptides include those derived from eukaryotic organisms but synthesized in 
E. coli or other prokaryotes. By "substantially pure DNA" is meant DNA that is 
free of the genes which, in the naturally-occurring genome of the organism from 

25 which the DNA of the invention is derived, flank the gene. The term therefore 
includes, for example, a recombinant DNA which is incorporated into a vector; into 
an autonomously replicating plasmid or virus; or into the genomic DNA of a 
prokaryote or eukaryote; or which exists as a separate molecule (e.g., a cDNA or a 
genomic or cDNA fragment produced by PCR or restriction endonuclease 

30 digestion) independent of other sequences. It also includes a recombinant DNA 
which is part of a hybrid gene encoding an additional polypeptide sequence. 
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Transformed or Transfected or Transgenic cell: refers to a cell into which 
(or into an ancestor of which) has been introduced, by means of recombinant DNA 
techniques, a DNA molecule encoding (as used herein) a SIMP polypeptide. By 
"transformation" is meant any method for introducing foreign molecules into a cell. 
5 Lipofection, calcium phosphate precipitation, retroviral delivery, electroporation, 
and ballistic transformation are just a few of the teachings which may be used. 

Transgenic animal: any animal having a cell which includes a DNA 
sequence which has been inserted by artifice into the cell and becomes part of the 
genome of the animal which develops from that cell. As used herein, the 
10 transgenic animals are usually mammalian (e.g., rodents such as rats or mice) and 
the DNA (transgene) is inserted by artifice into the nuclear genome. 

Ubiquitously expressed: refers to a polypeptide that is present, under 
normal conditions, in every single cell of an organism. 

Vector: A self-replicating RNA or DNA molecule which can be used to 
15 transfer an RNA or DNA segment from one organism to another. Vectors are 
particularly useful for manipulating genetic constructs and different vectors may 
have properties particularly appropriate to express protein(s) in a recipient during 
cloning procedures and may comprise different selectable markers. Bacterial 
plasm ids are commonly used vectors. 

20 

B) General overview of the invention 

The present inventors have discovered a protein called "SIMP" (Source of 
immunodominant MHC-associated Peptides). In human , this protein is the 
homolog of the mouse gene encoding B6 dom1 (referred herein as mouse SIMP). 
25 The human SIMP is also a paralog of human ITM1. The present inventors have 
also discovered uses for human SIMP proteins, fragments, nucleic acids, and 
antibodies for modulating its cellular levels and for diagnosing and treating 
cancers. Each of the aspects of the invention will be described in details 
hereinafter. 

30 
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/) Cloning and molecular characterization of SIMP 

As it will be described hereinafter in the exemplification section of the 
invention, the inventors have discovered, cloned and sequenced a human cDNA 
encoding a new human protein called human SIMP. This procedure was carried 
5 out starting with the amino acid sequence of a mouse minor histocompatibility 
antigen (MiHA) called "B6 dom1, \ 

The sequence of the SIMP cDNA and predicted amino acid sequence is 
shown in the "Sequence Listing" section. SEQ ID NO: 1 corresponds to the human 
SIMP cDNA and SEQ ID NO: 2 corresponds to the predicted amino acid sequence 

10 of the human protein. 

The hSIMP gene encodes a protein of 826 amino acids long. In silico 
analysis indicates that human SIMP protein has the following features: it has a 
molecular weight of about 93 674 g/mol, an isoelectric point of about 9.0; an 
instability index of about 41 (i.e. unstable); an aliphatic index of about 88; and a 

15 grand average of hydropathicity (GRAVY) of about 0.038. It further comprises 
many potential phosphorylation sites (26 Ser, 9 Thr, and 9 Tyr); and also many 
potential N-glycosylation and myristoylation sites. It also possesses more than 10 
potential transmembrane domains. 

As shown herein below, hSIMP protein contains an amino acid sequence 

20 having the potential of generating numerous peptides or peptide fragments 
possessing a high binding affinity motif for HLA class I molecules. This is very 
interesting since some but not all proteins generate peptides that are presented by 
MHC molecules. The most important factor determining whether a given peptide 
sequence will be presented by MHC molecules is its affinity for MHC molecules 

25 expressed by the cell in which it is expressed. Thus, a peptide with a low affinity 
for relevant MHC molecules will not form significant amounts of MHC/peptide 
complexes at the cell surface. On the contrary, the probability that a peptide with a 
high affinity for relevant MHC molecules will form significant levels of MHC/peptide 
complexes is about 68%. This is largely due to the fact that MHC class I molecules 

30 serve as templates for guiding ER aminopeptidases to generate the optimal MHC 
class I binding epitopes. In this way, the antigen-processing pathway efficiently 
generates peptides that fit exactly within the antigen binding grooves of the MHC 
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class I molecules. Peptide sequences in a given protein that have a high affinity for 
a specific HLA molecule can be predicted with the BIMAS™ algorithm. The validity 
of predictions based on this program has been confirmed in about fifty studies. 

Strikingly, many hSIMP peptides sequences possess a high affinity binding 
5 motif for HLA class I molecules. Those with the highest affinity are listed in 
Table 1. Methods of use of these peptides are described in the following sections. 



Table 1. Human SIMP-derived peptides with a high affinity binding motif for 
HLA molecules 

10 
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£^9 nft7 






ARQ 


Rl Ml Tl TP\/ 

r\L_IVIL_ 1 L 1 rv 


1ft 
I o 


c;Q1 ooq 






71 


1 I <^FTII Fl 
LLor i iL.ru 


1Q 


ARQ ^Qft 






RAL 


Ml Ml 1 MMFA 

IVI L IVI l_ LI VI IVI \r\ 




^q«; 9Q« 






971 


Ml IPI HY/FV 
INLIrLn Vr V 


91 


^R9 S^fi 






ft1 


Wl AHF^^RI 

V V L^TAVJ? 1 OOr\L 


99 


415 






9^0 


1 OFTYYI WV 

LVki 1 I I l_V V V 










9^S 


Yl WX/K^X/KT 
i i_ v v v rxo v r\ i 


94 


984 517 








FOTI FFI G\/ 




9^4 904 






435 


NINDERVFV 


26 


215.655 






291 


YIAYSTFYI 


27 


210.500 






428 


GLWFCIKNI 


28 


199.162 






172 


FLAPTFSGL 


29 


186.707 






460 


LMLTLTPW 


30 


129.543 






546 


LLMMFAVHC 


31 


118.745 






509 


NLYDKAGKV 


32 


118.628 






156 


ILNTLNITV 


33 


118.238 






358 


SLAAGAVFL 


34 


117.493 






179 


GLTSISTFL 


35 


117.493 






347 


QEFQTLFFL 


36 


112.763 






228 


FALQFTYYL 


37 


105.542 




10 


543 


MLMLLMMFAV 


38 


5836.011 






548 


MMFAVHCTWV 


39 


1737.776 






70 


SLLSFTILFL 


40 


999.867 






302 


LILSMQIPFV 


41 


760.945 






229 


ALQFTYYLWV 


42 


573.804 






386 


SLWDTGYAKI 


43 


532.542 






281 


LLMQRYSKRV 


44 


437.482 






365 


FLSVIYLTYT 


45 


433.632 






199 


LLAACFIAIV 


46 


423.695 






542 


LMLMLLMMFA 


47 


285.492 






470 


MLSAIAFSNV 


48 


224.653 






331 


LQAYAFLQYL 


49 


176.996 






258 


YMVSAWGGYV 


50 


165.213 
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155 


WILNTLNITV 


51 


162.769 






420 


ILVCTFPAGL 


52 


138.001 






179 


GLTSISTFLL 


53 


123.902 






545 


MLLMMFAVHC 


54 


A A Q ~T A C 

1 18.745 






271 


NLIPLHVFVL 


55 


116.840 






—J A 

71 


i i o i — r 1 1 r~i a 

LLSFTILFLA 


56 


A A O CtCA 

1 12.664 






546 


LLMMrAVHO 1 


C~7 

57 


-1 A7 QAQ 

1 .oUo 






yt C A 

459 


ni rv A 1 Tl TD\A/ 

KLML 1 L 1 rVV 


CO 

OO 


*i ac c«i n 
1 uo.oi U 






409 


T — TV A A /OCCCHI 

1 IWVorrrUL 


09 


a no *i o/i 


JL AAA 

A_0205 


a a 

10 


266 


\/\ >ni i km ini 

YVF1INLIPL 


A A 

60 


252.000 


A3 


9 


386 


SLWDTGYAK 


61 


A A A AAA 

300.000 














A24 


9 


561 


AYSSPSWL 


62 


200.000 






722 


YYRFGEMQL 


63 


200.000 






807 


GYIKNKLVF 


64 


150.000 






265 


GYVFIINLI 


65 


126.000 






694 


DYFTPQGEF 


66 


110.000 






445 


LYAISAVYF 


67 


100.000 






717 


MYKMSYYRF 


68 


100.000 




10 


451 


VYFAGVMVRL 


69 


280.000 






293 


AYSTFYIVGL 


70 


A*^ A^ A^ A*^ A*%_ A+^ 

200.000 






721 


SYYRFGEMQL 


71 


200.000 






375 


GYIAPWSGRF 


72 


150.000 






666 


GYSGDDINKF 


73 


132.000 


A68.1 


9 


642 


ETAAYKIMR 


74 


300.000 




10 


276 


HVFVLLLMQR 


75 


400.000 






450 


AWFAGVMVR 


76 


200.000 






786 


RVTN1FPKQK 


77 


120.000 






733 


RTPPGFDRTR 


78 


112.500 






158 


NTLNITVHIR 


79 


100.000 


B7 


9 


54 


APAGLSGGL 


a a 

80 


240.000 




10 


378 


APWSGRFYSL 


81 


240.000 






A f\ 

49 


APPKPAPAGL 


82 


A yl A. AAA 

240.000 


B8 


a a 

10 


747 


GNKDIKFKHL 


83 


A A A AAA 

120.000 




8 


8 


ESKH KSSL 


84 


160.000 


OA A 

B14 


9 


O O vl 

284 


nDVO!/D\/VI 

UKYoKKVYI 


oc 
OO 


1UU.UUU 




10 


439 


ERVFVALYAI 


86 


108.000 






284 


QRYSKRVYIA 


87 


100.000 


B_2702 


9 


284 


QRYSKRVYI 


88 


300.000 






599 


ARVMSWWDY 


89 


200.000 






87 


SRLFAVIRF 


90 


200.000 






135 


GRIVGGTVY 


91 


200.000 
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805 


KRGYIKNKL 


92 


180.000 






382 


GRFYSLWDT 


93 


100.000 




10 


93 


IRFESIIHEF 


94 


1000.000 






723 


YRFGEMQLDF 


95 


1000.000 






288 


KRVYIAYSTF 


96 


600.000 






340 


LRDRLTKQEF 


97 


200.000 






284 


QRYSKRVYIA 


98 


100.000 


B_2705 


9 


805 


KRGYIKNKL 


99 


6000.000 






284 


QRYSKRVYI 


100 


3000.000 






741 


TRNAEIGNK 


101 


2000.000 






584 


FREAYFWLR 


102 


1000.000 






87 


SRLFAVIRF 


103 


1000.000 






135 


GRIVGGTVY 


104 


1000.000 






732 


FRTPPGFDR 


105 


1000.000 






577 


TRNILDDFR 


106 


1000.000 






382 


GRFYSLWDT 


107 


1000.000 






599 


ARVMSWWDY 


108 


1000.000 






288 


KRVYIAYST 


109 


600.000 






803 


KRKRGYIKN 


110 


600.000 






649 


MRTLDVDYV 


111 


600.000 






592 


RQNTDEHAR 


112 


300.000 






346 


KQEFQTLFF 


113 


300.000 






230 


LQFTYYLWV 


114 


300.000 






189 


TRELWNQGA 


115 


200.000 






108 


YRSTHHLAS 


116 


200.000 






785 


PRVTNIFPK 


117 


200.000 






616 


NRTTLVDNN 


118 


200.000 






316 


IRTSEHMAA 


119 


200.000 






166 


IRDVCVFLA 


120 


200.000 






591 


LRQNTDEHA 


121 


200.000 






63 


SQPAGWQSL 


122 


200.000 






351 


TLFFLGVSL 


123 


150.000 






347 


QEFQTLFFL 


124 


150.000 






386 

\J v< 


SLWDTGYAK 


125 


150.000 






716 


LMYKMSYYR 


126 


125.000 






609 


YQIAGMANR 


127 


100.000 






406 


HQPTTWVSF 


128 


100.000 






93 


IRFESIIHE 


129 


100.000 






106 


FNYRSTHHL 


130 


100.000 






128 


ERAWYPLGR 


131 


100.000 






723 


YRFGEMQLD 


132 


100.000 






331 


LQAYAFLQY 


133 


100.000 




10 


504 


KRNQGNLYDK 


134 


6000.000 






723 


YRFGEMQLDF 


135 


5000.000 






93 


IRFESIIHEF 


136 


5000.000 






288 


KRVYIAYSTF 


137 


3000.000 






679 


VRIAEGEHPK 


138 


2000.000 






517 


VRKHATEQEK 


139 


2000.000 






649 


MRTLDVDYVL 


140 


2000.000 






803 


KRKRGYIKNK 


141 


1800.000 






337 


LQYLRDRLTK 


142 


1000.000 






284 


QRYSKRVYIA 


143 


1000.000 






591 


LRQNTDEHAR 


144 


1000.000 
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a0*fc f^am A«a. a. pa^ 

SEQ ID no: 


Score 






340 


LRDRLTKQEF 


145 


* aaa. a*ai a***. _«Bk -aa. 

1000.000 






230 


A a^awaaaaan * ■ a • »* j a ^ 

LQFTYYLWVK 


146 


1000.000 






346 


KQEFQTLFFL 


.at al aaa> 

147 


a***, a***. a**4 a*»* 

600.000 






458 


VRLMLTLTPV 


.at A a*X 

148 


600.000 






489 


KRENPPVEDS 


.a* A a**4 

149 


a***, a*N a*N AAA 

600.000 






805 


KRGYIKNKLV 


~J f*" a*N 

150 


540.000 






111 


a. a aaaa. aaBaaati aak a mm* P"""V f" - **. 

NRETLDHKPR 


151 


300.000 






213 


jBh bmbl ^aa, a. a m ^aak bbbbi bbb. bl a. 

SRSVAGSFDN 


152 


200.000 






68 


WQSLLSFTIL 


153 


a**a a""a a*%. >*4 a*a4 a*X. 

200.000 






108 


m m aah «aa>. aaa) ■ a a ■ a a\ ^■■a. I a 

YRSTHHLASH 


154 


a^^a, a*V ***a a**a a*% •'"a 

200.000 






331 


■ >lk aV a> at a\ aat a ^av. V ■ • 

LQAYAFLQYL 


155 


S—m ««a> jaw rfa» /a, 

200.000 


B_2705 


10 


616 


NRTTLVDNNT 


156 


200.000 






29 


SRHGHHGPGA 


157 


200.000 






316 


IRTSEHMAAA 


158 


200.000 






702 


FRVDKAGSPT 


159 


200.000 






732 


FRTPPGFDRT 


160 


a ja -a. A *«av ^a, 

200.000 






63 


SQPAGWQSLL 


161 


200.000 






592 


RQNTDEHARV 


J ya*> 

162 


^ ^*a a*X /*S 

180.000 






716 


LMYKMSYYRF 


163 


125.000 






406 


HQPTTWVSFF 


164 


100.000 






382 


GRFYSLWDTG 


165 


100.000 


B_3501 


10 


686 


HPKDIRESDY 


166 


240.000 


B_3701 


10 


704 


VDKAGSPTLL 


167 


200.000 


B_3801 


9 


573 


a. a a a aa, ja>_, . — ■ aK a, a ■ a 

NHDGTRNIL 


168 


180.000 


B_3901 


9 


573 


NHDGTRNIL 


168 


135.000 




10 


164 


VHIRDVCVFL 


169 


180.000 


B_4403 


9 


438 


DERVFVALY 


170 


j —a. ya, ^a. aa^ ^a. ^a. 

1080.000 






762 


SEHWLVRIY 


171 


aaa aa aa ja> ^ak 

720.000 






100 


a a a«aa» aaaa> aak avaa. a at M aaa a. a a. m 

HEFDPWFNY 


172 


180.000 






596 


aaw avam a a a\ aaaa. a. t a. ai a*aa. a at a 

DEHARVMSW 


173 


a -«av A«a. a*x ^av 

108.000 




10 


744 


at aaaaa a .^paa. av a a ^ apaav a A .a* fjaaa 

AEIGNKDIKF 


174 


^ a*% A"" a*"4 a*X a*"a #^ 

1350.000 






319 


SEHMAAAGVF 


M mmmm -aaa 

175 


180.000 


B_5101 


9 


308 


a aaaav aaaaaa. a ^aa. aaa .alPaa, am. a 

IPFVGFQPI 


176 


^ **** ^a^ j ^aa. 

1384.240 






425 


FPAGLWFCI 


177 


572.000 






261 


.aaav. a\ * at U — ' — ^a*. a, *v*. a aajaa a 

SAWGGYVFI 


178 


al J a*X a«*a a**a 

484.000 






90 


FAVIRFESI 


179 


^a. a a -ax /a 

314.600 






208 


m * j- L_ m ■ ^a. aaa, **a*. a, m 

VPGYISRSV 


180 


314.600 


* 




392 


YAKIHIPII 


181 


A a ^at ja>. -taw 

314.600 






743 


NAEIGNKDI 


182 


292.820 






292 


IAYSTFYIV 


183 


286.000 






18 


SPWSGLMAL 


184 


242.000 






560 


NAYSSPSW 


185 


220.000 






129 


RAWYPLGRI 


186 


220.000 






758 


EAFTSEHWL 


187 


220.000 






443 


VALYAISAV 


188 


157.300 






644 


AAYKIMRTL 


189 


146.410 






273 


IPLHVFVLL 


190 


143.000 






200 


LAACFIAIV 


191 


143.000 
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Sequence 
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64 


A A""V I A #—\ ■ | 

QPAGWQSLL 


192 


A an a AN an ^n 

121.000 






332 


QAYAFLQYL 


.A AN AN 

193 


A A A% y^ 

121.000 






300 


ft # J" - ^ ft 11 A*V ■ « ■ 

VGLILSMQI 


A V**t A 

194 


AAA A rf~V AN 

114.400 






54 


APAGLSGGL 


195 


A A AX AN /"\ AN 

110.000 






360 


AAGAVFLSV 


>i 

196 


110.000 




10 


A AN A» 

465 


TPWCMLSAI 


A AN 

197 


A AN A AN AN AN 

484.000 






174 


A 1™^ i^™ A^ A - ^ ■ A"\ 1 

APTFSGLTSI 


A AN AN 

198 


A /*N A /"N ^N ^N 

484.000 






261 


V A 1 A f A N A"^ V /\ / f | 4 

SAWGGYVFII 


199 


A A an AN /\ ^> 

440.000 






758 


ft^™ A ft^™^^^ ■ II A J 1 ft. # 

EAFTSEHWLV 


200 


A AN /\ 

400.000 






216 


\ # A A^N ft*^ A 1 ft^™ A^N ft 

VAGSFDNEGI 


^N _A 

201 


A A AN AN AN 

314.600 






681 


■ A A 1 ^ • ft ft"^ ft A* ftF*^ ft 

IAEGEHPKDI 


202 


314.600 


B_5101 


10 


90 


FAVIRFESII 


203 


286.000 






360 


AAGAVFLSV! 


204 


220.000 






196 


GAGLLAACFI 


205 


220.000 






264 


GGYVFIINLI 


206 


212.960 






529 


EGLGPNIKSI 


207 


212.960 






378 


APWSGRFYSL 


208 


200.000 






390 


TGYAKIHIPI 


209 


176.000 






359 


LAAGAVFLSV 


210 


157.300 






143 


YPGLMITAGL 


211 


143.000 






273 


IPLHVFVLLL 


212 


130.000 






49 


APPKPAPAGL 


213 


121.000 






6 


APESKHKSSL 


214 


110.000 






129 


RAWYPLGRIV 


215 


110.000 






449 


SAVYFAGVMV 


216 


110.000 






560 


NAYSSPSWL 


217 


100.000 


B_5102 


9 


308 


IPFVGFQPI 


218 


2420.000 






129 


RAWYPLGRI 


219 


2000.000 






90 


FAVIRFESI 


220 


1320.000 






261 


SAWGGYVFI 


221 


1210.000 






425 


FPAGLWFCI 


222 


880.000 






292 


IAYSTFYIV 


223 


550.000 






18 


SPWSGLMAL 


224 


550.000 






560 


NAYSSPSW 


225 


500.000 






228 


FALQFTYYL 


226 


399.300 






273 


IPLHVFVLL 


227 


363.000 






644 


AAYKIMRTL 


228 


332.750 






443 


ft J A ft V A A ■ *""V A ft A 

VALYAISAV 


229 


/\ AN AN AN ^N ^N 

330.000 






332 


^"*V A %v M A f™~ ft ^"^V V A ft 

QAYAFLQYL 


230 


302.500 






758 


f A ■ ™ 1 T * A"\ p 1 1 ft ft ft A ■ ft 

EAFTSEHWL 


231 


275.000 






197 


A ft ■ A A p ■ « 

AGLLAACFI 


232 


264.000 






806 


RGY1KNKLV 


233 


A^ A /\ AN AN AN 

242.000 






300 


ft f rf-^v ■ I ■ A A ft 

VGLILSMQI 


234 


AN A AN AN AN AN 

240.000 






392 


YAKIHIPII 


235 


220.000 






208 


VPGYISRSV 


236 


220.000 






743 


NAEIGNKDI 


237 


133.100 






64 


QPAGWQSLL 


238 


121.000 






314 


QPIRTSEHM 


239 


119.790 






200 


LAACFIAIV 


240 


110.000 






54 


APAGLSGGL 


241 


110.000 






.360 


AAGAVFLSV 


242 


110.000 






264 


GGYVFI1NL 


243 


110.000 




10 


90 


FAVIRFESII 


244 


1200.000 
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465 


TPWCMLSAI 


245 


1200.000 






261 


SAWGGYVFII 


246 


1100.000 






129 


RAWYPLGRIV 


247 


550.000 






758 


EAFTSEHWLV 


248 


550.000 






378 


APWSGRFYSL 


249 


500.000 






264 


GGYVFIINLI 


250 


440.000 






174 


APTFSGLTSI 


251 


440.000 






390 


TGYAKIHIPI 


252 


400.000 






529 


EGLGPNIKSI 


253 


351.384 






328 


FALLQAYAFL 


254 


330.000 






273 


IPLHVFVLLL 


255 


330.000 






449 


SAVYFAGVMV 


256 


300.000 


B_5201 


10 


427 


AGLWFCIKNI 


257 


290.400 






560 


NAYSSPSWL 


258 


250.000 






216 


VAGSFDNEGI 


259 


242.000 






143 


YPGLMITAGL 


260 


242.000 






196 


GAGLLAACFI 


261 


220.000 






360 


AAGAVFLSVI 


262 


200.000 






83 


AGFSSRLFAV 


263 


200.000 






362 


GAVFLSVIYL 


264 


165.000 






681 


IAEGEHPKDI 


265 


121.000 






359 


LAAGAVFLSV 


266 


121.000 






355 


LGVSLAAGAV 


267 


120.000 






453 


FAGVMVRLML 


268 


110.000 






49 


APPKPAPAGL 


269 


110.000 


B_5103 


9 


560 


NAYSSPSW 


270 


300.000 






292 


1AYSTFYIV 


271 


300.000 






443 


VALYAISAV 


272 


1 59 720 






261 


SAWGGYVFI 


273 


133.100 






806 


RGYIKNKLV 


274 


120.000 






90 


FAVIRFESI 


275 


110.000 






200 


LAACFIAIV 


276 


110.000 






360 


AAGAVFLSV 


277 


110.000 






743 


NAEIGNKDI 


278 


110.000 






392 


YAKIHIPII 


279 


110.000 






129 


RAWYPLGRI 


280 


100.000 




10 


264 


GGYVFIINLI 


281 


145.200 






758 


EAFTSEHWLV 


282 


132.000 






390 


TGYAKIHIPI 


283 


132.000 






449 


SAVYFAGVMV 


284 


121.000 






359 


LAAGAVFLSV 


285 


121.000 






196 


GAGLLAACFI 


286 


121.000 






216 


VAGSFDNEGI 


287 


110.000 






681 


IAEGEHPKDI . 


288 


110.000 






261 


SAWGGYVFII 


289 


110.000 






129 


RAWYPLGRIV 


290 


a f\f\ r\r\r\ 

100.000 






90 


FAVIRFESII 


291 


100.000 






360 


AAGAVFLSVI 


292 


100.000 


B_5201 


9 


531 


LGPNIKSIV 


293 


330.000 






292 


IAYSTFYIV 


294 


123.750 






130 


AWYPLGRIV 


295 


120.000 




10 


806 


RGYIKNKLVF 


296 


165.000 
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129 


RAWYPLGRIV 


297 


100.000 


B_5801 


9 


239 


KSVKTGSVF 


ftft 0 
298 


ft A ft ftft ft 

240.000 






A ft 

12 


O O 1 MOO r>\ A / 

KSSLNSSPW 


299 


ft A ft ft ft ft 

240.000 






380 


WSGRFYSLW 


300 


120.000 




a ft 

10 


239 


KSVKTGSVFW 


301 


A ft ft ft ft ft 

480.000 






617 


RTTLVDNNTW 


ftft 
302 


ft ftft A ft ft 

290.400 






72 


LSFTILFLAW 


303 


158.400 






ft ^ >■ 
254 


LSYFYMVSAW 


ft ft A 

304 


AAA ftftft 

144.000 


B60 


9 


347 


QEFQTLFFL 


305 


160.000 






AAA 

222 


k. i a^*% i a i a i 

NEGIAIFAL 


ft ft 

306 


160.000 




10 


757 


EEAFTSEHWL 


307 


320.000 






190 


RELWNQGAGL 


308 


320.000 






522 


TEQEKTEEGL 


309 


160.000 


B62 


9 


283 


MQRYSKRVY 


310 


132.000 






365 


FLSVIYLTY 


311 


105.600 



ii) SIMP homology of with other genes and proteins 

As mentioned previously, the cloning of hSIMP was carried out starting with 
5 the putative amino acid sequence of a mouse minor histocompatibility antigen 
(MiHA) called "B6 dom1, \ Prior to the present invention, the identity of the mouse 
gene encoding the B6 dom1 MiHA was unknown. A blast search revealed that 
human SIMP is highly homologous to a mouse gene (GENBANK™ accession No 
AK018758) for which no formal name nor biological role have been assigned. This 

10 mouse gene, referred hereinafter as mouse SIMP (mSIMP), contains an open 
reading frame of 2469 bp (SEQ. ID. NO: 3) and encodes a protein of some 823 
amino acids (SEQ. ID. NO: 4). 

Although not shown, the cDNA sequence of SEQ ID NO:150 of international 
PCT application WO 01/19988 (see GENBANK™ accession No AK027789) 

15 shares 100% identity with nucleic acids no 1510 to 2481 of hSIMP. The protein 
sequence of SEQ ID NO:151 of the same PCT application (see GENBANK™ 
accession No BAB55370) shares 100% identity with the C-terminal end of the 
human SIMP protein (amino acids no 541 to 826). SEQ ID NO:150 and 151 of 
WO 01/19988 correspond to an EST and a predicted protein for which no function 

20 is described. 
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Analysis of human and mouse SIMPs confirms that the two genes and 
proteins are highly homologous to each other. Indeed, the conservation between 
the hSIMP and mSIMP genes is striking. These are roughly 90% identical at the 
DNA level, while in terms of encoded amino acids the two proteins are 97% 
5 identical. This is strongly suggestive of the existence of a strong selection pressure 
to maintain the sequence and biological function of this protein across species. 
Since mSIMP is ubiquitously expressed in mice, it is expected that the same holds 
true for hSIMP. Applicants preliminary results (arrays) show that SIMP is fairly 
ubiquitous in human (not shown). However, sequencing of hSIMP cDNA in 
10 fourteen unrelated individuals (not shown) confirms that contrary to mSIMP, 
hSIMP is not polymorphic, i.e. hSIMP occurs in a single form in human. This 
means that probes and reagents that recognize or react with hSIMP from one 
individual should recognize or react in the same way with hSIMP from all human 
subjects. 

15 Blast searches were also made to identify sequence identity between 

hSIMP, mSIMP and other existing sequences. As shown hereafter in Table 2 
and Table 3, hSIMP and mSIMP were found to be highly homologous to yeast 
STT3 (GENBANK™ accession No D28952 (DNA; SEQ ID NO:5) and No 
BAA06079 (protein; SEQ ID NO:6); T12A2.2 C. Elegans (GENBANK™ accession 

20 No P46975 (protein; SEQ ID NO: 13); drosophila STT3 (GENBANK™ No 
AF1 32552 (DNA; SEQ ID NO:7 and protein; SEQ ID NO:8), mouse ITM1 
(GENBANK™ accession No NM_008408 (DNA; SEQ ID NO:9) and NP_032434 
(protein; SEQ ID NO:10)), and human ITM1 (GENBANK™ accession No 
NM_002219 (DNA; SEQ ID NO:11) and No NPJ302210 (protein; SEQ ID NO:12)). 

25 Standard techniques, such as the polymerase chain reaction (PCR) and 

DNA hybridization, may be used to clone additional SIMP homologues in other 
species. 
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Interestingly, the hSIMP gene encodes a protein of 826 amino acids which 
exhibits 53% identity and 69% similarity to yeast STT3, which establishes it as a 
novel member of this group of genes. Yeast STT3 is a subunit of a large complex 
required for the appropriate co-translational N-glycosylation of proteins, a 
5 modification that is characteristic of eukaryotes and is involved in chaperone- 
mediated protein folding. Disruption of this gene in yeast demonstrated that it is 
essential for cell growth, underscoring its likelihood to be critical for normal cellular 
function in higher eukaryotes. There appears to be a family of proteins directly 
related to STT3, with homologs found even in lower organisms such as 

10 archaebacteria, in addition to equivalents in higher organisms including mice and 
humans. That these proteins are remarkably well conserved across divergent 
species indicates a strong evolutionary pressure for maintenance of biological 
function of this family. 

The genes of mice and humans heretofore identified as being structurally 

15 and functionally related to STT3, is known as ITM1, for Integral Membrane 
Protein-1. The protein encoded by mouse ITM1 was found to contain many 
putative transmembrane domains and possesses roughly 52% identity and 66% 
similarity to yeast STT3, respectively. The T12A2.2 gene in C.elegans encodes a 
protein that is similarly conserved with both STT3 and ITM1, and represents 

20 another member of this family of proteins. In Drosophila melangoster there are 
homologs of both STT3 and ITM1 on different chromosomes, indicatory of the 
evolutionary separation of these genes. A human equivalent of ITM1 has also 
been cloned which has a similar degree of homology with STT3 as the mouse 
protein, but, interestingly, the proteins mice and humans are 97% identical, 

25 underlining the potentially major role of this protein in higher organisms. 

Human SIMP is in turn 59% identical and 73% similar to human ITM1, 
which, while significant, distinguishes it from its human homolog. Intriguingly, 
hSIMP protein is more similar to the C. elegans and D. melangoster STT3-like 
proteins (roughly 70% identity and 80% similarity) than it is to human ITM1. This 

30 would suggest that hSIMP evolved separately from ITM1, and that indeed hSIMP 
and ITM1 are functionally distinct. This is further emphasized by the degree of 
homology between human and mouse ITM1; these two proteins are roughly 98% 
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identical. Given the levels of identity between human SIMP and human ITM1, 
these two proteins presumably perform perhaps related but unique roles in 
humans. It is also proposed herein that the two genes are paralogs (i.e. 
homologous genes that diverged by gene duplication). Because hSIMP and hlTM1 
5 are paralogs, they may have similar roles, perhaps in different cell types. 
Accordingly, hSIMP may have a biological function similar to that of ITM1, and 
ITM1 an immunological function similar to that of hSIMP. For instance, we have 
verified using the BIMAS search tool, that similar to hSIMP, human ITM1 has the 
potential to generate protein fragments that bind with high affinity to HLA 
10 molecules (data not shown). The present invention therefore encompasses any 
use of such ITM1 -derived polypeptides, particularly in cancer immunotherapy. The 
invention also encompasses any sequences, probe, kit, method involving human 
ITM1 for similar uses as those mentioned throughout the present application for 
human SIMP. 

15 Given the high sequence homology of SIMP with STT3 and ITM1, it is 

reasonable to hypothesize that these proteins may have similar biological 
functions. Yeast STT3 and mouse ITM1 are known to be part of the 
oligosaccharyltransferase (OST) complex. N-linked protein glycosylation is an 
essential process in eukaryotic cells. In the central reaction, OST catalyzes the 

20 transfer of the oligosaccharide Glc 3 Man 9 GlcNac2 from dolicholpyrophosphate onto 
asparagine residues of nascent polypeptide chains in the lumen of the 
endoplasmic reticulum. A major function for sugars is to contribute to the stability 
of the proteins to which they are attached. Moreover, specific glycoforms are 
involved in recognition events. Like protein translocation, N-linked glycosylation 

25 clearly belongs to the functions that the ER has inherited from the prokaryotic, 
most likely archaeal, plasma membrane. STT3 and ITM1 proteins, transmembrane 
proteins with a C-terminal, lumenally oriented, hydrophilic domain, are part of the 
OST complex. Depletion of STT3 protein and mutation of STT3 result in loss of 
transferase activity in vivo, a deficiency in the assembly of the OST complex and 

30 loss of cell growth and viability which may be corrected by transfection with STT3 
or ITM1. Consistent with a role of STT3p homologs in cell proliferation, ITM1 
transcripts are expressed predominantly in tissues undergoing active proliferation 
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and differentiation. Tables 1 and 2 also shows a surprising degree of conservation 
of the STT3 protein between yeast and higher eukaryotes. 

Furthermore, OST activity seems to be particularly important for the cells of 
the immune system. This might not be surprising since almost all of the key 
5 molecules involved in the innate and adaptive immune response are glycoproteins. 
Specific glycoforms control crucial events in recognition of APCs by T-cells: 
assembly of MHC-peptide complexes, formation of immunological synapse, 
recognition of antigenic peptide-loaded MHC molecules by the TCRs and signal 
transduction. In previous studies OST activity was found to increase 10-fold after 

10 mitogen activation of PBLs. The number of copies of B6 dom1 MiHA per cell (a 
peptide from mSIMP) was shown to increase by 128-fold on mitogen activated T- 
cells relative to resting splenocytes. Interestingly, previous studies have shown 
levels of Dad1 (the defender against apoptotic cell death, a member of the OST 
complex) are modulated during T-cell development, to reach maximal expression 

15 in mature T-cells, and peripheral T-cells of Dadf-transgenic mice display 
hyperproliferation in response to stimuli. All these observations suggest that SIMP 
could be particularly important for cells with a high proliferation rate. 

Hi) T-cell immunotherapy targeted to MHC-associated peptides encoded by 
20 SIMP 

SIMP polypeptides may be useful for eliminating tumoral cells in human and 
more particularly hematopoietic cancer cells. This may be achieved by injecting 
into a cancer bearing host T-lymphocytes, that recognize complexes of SIMP- 
derived peptide/MHC on cancer cells. In a preferred embodiment, the 
25 SIMP-derived peptide comprises at least eight sequential amino acids of SEQ ID 
NO:2 (hSIMP). More preferably, the fragment is selected from the fragment listed 
in Table 1 . 
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Since ITM1 and SIMP are paralogs, the method could potentially be used 
by targeting ITM1-derived peptides/MHC complexes as well. Preferably, the ITM1- 
derived peptide will be selected from the peptides that comprise at least nine 
sequential amino acids of SEQ ID NO: 12 (hlTM1). 
5 Some of the methods of T-lymphocytes selection and methods of 

immunotherapy are described in detail in PCT application No. PCT/CA0 1/0 1477 
which is incorporated herein by reference. Four immunotherapeutic situations can 
be envisaged depending on the type of effector T-cells used and on the nature of 
the target SIMP-derived peptide. Indeed, T-cells can be i) allogeneic, that is, T- 
10 cells obtained from another individual or ii) self, that is, the patient's T-cells. The 
target SIMP peptide can be either polymorphic or non polymorphic. 

Situation 1: Allogeneic T-cells, non polymorphic peptide target. 

According to a preferred embodiment, T-cells that specifically recognize the 

15 target MHC/SIMP peptide epitope (alio MHC-restricted T-cells) will be generated 
from an MHC-incompatible donor. In vitro T-cell expansion will be carried out using 
current cell culture techniques following stimulation with the target epitope or a 
heteroclitic variant of the SIMP peptide (a variant of the peptide whose sequence 
has been modified to increase its immunogenicity). Heteroclitic peptides may be 

20 synthesized by replacing one (or a few) natural amino acids in a polypeptide by an 
amino acid that is predicted (using a tool such as BIMAS HLA peptide binding 
predictions) to bind with a superior affinity to a few MHC molecules. T-cells that 
react with the target epitope will be purified with the MHC/SIMP-peptide tetramers, 
cloned, and their innocuity for normal host cells will be assessed with in vitro 

25 assays ( 3 H-thymidine or 51 Cr release, cytokine production). The selected and 
expanded T-cell clones will be injected into the blood vessels of the recipient. 
Injected T lymphocytes will then "seek and destroy" neoplastic cells located in 
various tissues and organs. 

30 Situation 2: Allogeneic T-cells, polymorphic peptide target 

This embodiment is carried out as in Situation 1, except that the donor that 
is selected is MHC-identical with the recipient. MHC identity is assessed based on 
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currently available methods of MHC typing using antibodies and nucleotide 
probes. In this case, the T-cells are said to be self MHC-restricted and the target 
peptide is called an MiHA. 

5 Situation 3: Self T-cells transfected with an allogeneic TCR specific for a 
polymorphic or non polymorphic peptide target 

T-cell clones are generated as in Situations 1 and 2. However, rather than 
injecting allogeneic T-cells into the recipient, the T-cell receptor (TCR) of these 
allogeneic T-cells is cloned and used to transfect recipient T-cells in vitro 
10 (Stanislawski et al., 2001, Nat Immunol 2:962-970; Kessels et al., 2001, 
Natlmmunol 2:957-961). Transfected T-cells are then injected back into the 
recipient as described previously. 

Situation 4: Self T-cells not transfected with an allogeneic TCR and targeted to a 

15 polymorphic or non polymorphic target 

According to a preferred embodiment, T-cells from a cancer bearing patient 
are stimulated in vitro with antigen presenting cells expressing the target MHC- 
associated SIMP-peptide or a heteroclitic variant of the SIMP peptide (See 
situation 1). Expression of the target peptide can be either endogenous, or induced 

20 by RNA or cDNA transfection or pulsing with synthetic peptide using currently 
available methods. T-cells reacting with optimal avidity with cells expressing the 
target epitope are purified and expanded using currently available methods (Yee et 
ai, 1999, J.Immunol 162:2227-2234; Bullock et al., 2001, J. Immunol. 167:5824- 
5831) then injected into the recipients. 

25 

iv) SIMP Therapies 

Therapies may be designed to circumvent or overcome an inadequate 
SIMP gene expression. Indeed, SIMP seems to be expressed in higher levels in 
high proliferative cells. Therefore, SIMP protein or polypeptides may be effective 
30 proliferative agents and increasing their intracellular levels may help or stimulate 
cell proliferation. This could be accomplished for instance by transfection of SIMP 
cDNA. Thus, cancer treatment with radiotherapy and chemotherapy is currently 
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limited by the hematological toxicity of these treatment modalities, that is, the 
length of time required for proliferation of hematopoietic progenitors to restore 
normal levels of blood cells. Therefore, the following strategy could be used to 
shorten the length of blood cytopenias following chemo or radiotherapy: 
5 hematopoietic progenitors harvested from the blood or the bone marrow of a 
patient are transfected with SIMP cDNA and the transfected cells are then re- 
injected into the patient before a cycle of chemo/radiotherapy. 

To obtain large amounts of pure SIMP, cultured cell systems would be 
preferred. Delivery of the protein to the affected tissues can then be accomplished 
10 using appropriate packaging or administrating systems. Alternatively, it is 
conceivable that small molecule analogs could be used and administered to act as 
SIMP agonists and in this manner produce a desired physiological effect. Methods 
for finding such molecules are provided herein. 



15 v) Downrepulation of SIMP expression 

1) For cancer therapy 

We have previously shown that T-cells targeted to the B6 dom1 peptide 
(derived from mSIMP) were extremely effective in eradicating B6 dom1 -positive cells 
(see PCT/CA0 1/0 1477). A corollary is that cancer cells could not escape a T-cell 

20 attack by downregulating SIMP expression or by expressing SIMP mutants. Thus, 
consistent with a crucial role of STT3 homologs in cell proliferation, we propose 
that SIMP expression is essential for cancer cell proliferation. Accordingly, 
downmodulation of SIMP could be used to treat cancer. Therefore, the invention 
relates to methods for modulating tumoral cell survival or for eliminating a tumoral 

25 cell in a human by reducing cellular expression levels of a human SIMP 
polypeptide. In a preferred embodiment, this is achieved by delivering an 
antisense into the tumoral cells. This can be achieved by intravenous injection 
using currently available methods (e.g. Crooke et a/., (2000), Oncogene 19, 6651- 
6659; Stein et a/. (2001), J Clin. Invest 108, 641-644; and Tamm et a/., (2001), 

30 Lancet 358, 489-497. Theoretically, this approach could be used for all types of 
cancer and should be most useful for those that proliferate more rapidly, that is, 
the most malignant cancers (e.g. hematopoietic cancer, lung cancers, intestine 
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cancers, prostate cancer, testis cancer, breast cancer, melanomas, pancreatic 
cancer sarcomas, prostate cancer and hematologic cancers). 

2) For modulating immune responses 
5 As mentioned above, OST activity seems to be particularly important for 

T-lymphocytes function. Furthermore, the previous observation that the number of 
copies of B6 dom1 MiHA per cell (a peptide from mSIMP) was increased 128-fold on 
mitogen activated T-cells relative to resting splenocytes, suggests that SIMP is 
very important for T-cell activation/proliferation. Accordingly, downmodulation of 

10 SIMP expression could be used to dampen immune responses, particularly in the 
context of transplantation or autoimmune diseases. 

Therefore, the invention also relates to methods for modulating an immune 
response by reducing cellular expression levels of a SIMP polypeptide. In a 
preferred embodiment, the method is used for decreasing lymphoid cell 

15 proliferation, and it comprises the step of decreasing in these cells cellular 
expression levels of a SIMP polypeptide. Such a method may be particularly useful 
for dampening deleterious immune responses occurring in recipients of organ or 
tissue transplant and in people with autoimmune disease. We infer that inhibition 
of SIMP function could be useful to prevent or treat transplant rejection and to treat 

20 autoimmune diseases such as diabetes, multiple sclerosis, rheumatoid arthritis 
etc. Preferably, reduced SIMP cellular expression is obtained by delivering a SIMP 
antisense into lymphoid cells by intravenous injection. 

According to a related aspect of the two above-mentioned methods, the 
invention relates to antisense nucleic acids and to pharmaceutical compositions 

25 comprising such antisenses, the antisense being capable of reducing hSIMP 
cellular levels of expression. Preferably, the antisense nucleic acid is 
complementary to a nucleic acid sequence encoding a hSIMP protein or encoding 
any of the polypeptides derived therefrom and more particularly those listed in 
Table 1. More preferably, the antisense hybridizes under high stringency 

30 conditions to a genomic sequence or to a mRNA. Even more preferably, the 
antisense of the invention hybridizes under high stringency conditions to SEQ ID 
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NO: 1 (hSIMP) or to a complementary sequence thereof. A non limitative example 
of high stringency conditions includes: 

a) pre-hybridization and hybridization at 68°C in a solution of 5X SSPE (1X 
SSPE = 0.18 M NaCI, 10 mM NaH 2 P0 4 ); 5X Denhardt solution; 0.05% 

5 (w/v) sodium dodecyl sulfate (SDS); et 100 pg/ml salmon sperm DNA; 

b) two washings for 10 min at room temperature with 2X SSPE and 0.1% 
SDS; 

c) one washing at 60°C for 15 min with 1X SSPE and 0.1% SDS; and 

d) one washing at 60°C for 15 min with 0.1X SSPE et 0.1% SDS. 

10 

vi) Administration of SIMP Polypeptides. Modulators of SIMP Synthesis or 
Function 

A SIMP protein, polypeptide, or modulator (e.g. antisense) may be 
administered within a pharmaceutically acceptable diluent, carrier, or excipient, in 

15 unit dosage form. Conventional pharmaceutical practice may be used to provide 
suitable formulations or compositions to administer SIMP protein, polypeptide, or 
modulator to patients. Administration may begin before the patient is symptomatic. 
Any appropriate route of administration may be employed, for example, 
administration may be parenteral, intravenous, intraarterial, subcutaneous, 

20 intramuscular, intracranial, intraorbital, ophthalmic, intraventricular, intracapsular, 
intraspinal, intracisternal, intraperitoneal, intranasal, aerosol, by suppositories, or 
oral administration. Therapeutic formulations may be in the form of liquid solutions 
or suspensions; for oral administration, formulations may be in the form of tablets 
or capsules; and for intranasal formulations, in the form of powders, nasal drops, 

25 or aerosols. 

Methods well known in the art for making formulations are found, for 
example, in "Remington's Pharmaceutical Sciences." Formulations for parenteral 
administration may, for example, contain excipients, sterile water, or saline, 
polyalkylene glycols such as polyethylene glycol, oils of vegetable origin, or 
30 hydrogenated napthalenes. Biocompatible, biodegradable lactide polymer, 
lactide/glycolide copolymer, or polyoxyethylene-polyoxypropylene copolymers may 
be used to control the release of the compounds. Other potentially useful 
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parenteral delivery systems include ethylene-vinyl acetate copolymer particles, 
osmotic pumps, implantable infusion systems, and liposomes. Formulations for 
inhalation may contain excipients, for example, lactose, or may be aqueous 
solutions containing, for example, polyoxyethylene-9-lauryl ether, glycocholate and 
5 deoxycholate, or may be oily solutions for administration in the form of nasal 
drops, or as a gel. 

If desired, treatment with a SIMP protein, polypeptide, or modulatory 
compound may be combined with more traditional therapies for the disease such 
as surgery, steroid therapy, or chemotherapy for autoimmune disease; other 

10 immunosuppressive agents for transplant rejection; and radiotherapy, 
chemotherapy for cancer. 

According to a preferred embodiment, A SIMP antisense would be 
incorporated in a pharmaceutical composition comprising at least one of the 
oligonucleotides defined previously, and a pharmaceutical^ acceptable carrier. 

15 The amount of antisense present in the composition of the present invention is a 
therapeutically effective amount. A therapeutically effective amount of antisense is 
that amount necessary so that the antisense performs its biological function 
without causing overly negative effects in the host to which the composition is 
administered. The exact amount of oligonucleotides to be used and composition to 

20 be administered will vary according to factors such as the oligo biological activity, 
the type of condition being treated, the mode of administration, as well as the other 
ingredients in the composition. Typically, the composition will be composed of 
about 1% to about 90% of antisense, and about 20 pg to about 20 mg of antisense 
will be administered. For preparing and administering antisenses as well as 

25 pharmaceutical compositions comprising the same, methods well known in the art 
may be used. For instance, see Crooke etai (Oncogene, 2000,19:6651-6659) and 
Tamm et al. (Lancet 200,1358:489-497) for a review of antisense technology in 
cancer chemotherapy. 

30 vii) Upregulation of SIMP expression 

Upregulation of SIMP expression in T-lymphocytes could be used to 
increase T-lymphocyte proliferation following antigen encounter. Indeed, it is 
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suggested that upregulation of SIMP would increase the size of effector T-cell and 
memory T-cell pools, that is, the efficacy of T-cell responses and the duration of a 
biologically relevant (protective) T-cell memory. In other words, increased SIMP 
function would be used as an immune adjuvant. 
5 Therefore, the invention also relates to methods for modulating an immune 

response by increasing cellular expression levels of a SIMP polypeptide in 
lymphoid cells. In a preferred embodiment, such a method is used for increasing 
the level and/or the duration of an antigen-primed lymphocyte proliferation. 
Preferably, this is achieved by transfecting in vivo or ex vivo lymphocytes with a 
10 SIMP cDNA. Targeted lymphocytes can be CD4 T-cells and/or CD8 T-cells and/or 
B-cells. 

viii) Synthesis of SIMP and fragments thereof 

The characteristics of the cloned SIMP gene sequence may be analyzed by 

15 introducing the sequence into various cell types or using in vitro extracellular 
systems. The function of SIMP may then be examined under different 
physiological conditions. The SIMP DNA sequence may be manipulated in studies 
to understand the expression of the gene and gene product. Alternatively, cell lines 
may be produced which overexpress the gene product allowing purification of 

20 SIMP for biochemical characterization, large-scale production, antibody 
production, and patient therapy. 

For protein expression, eukaryotic and prokaryotic expression systems may 
be generated in which the SIMP gene sequence is introduced into a plasmid or 
other vector which is then introduced into living cells. Constructs in which the SIMP 

25 cDNA sequence containing the entire open reading frame inserted in the correct 
orientation into an expression plasmid may be used for protein expression. 
Alternatively, portions of the sequence, including wild-type or mutant SIMP 
sequences, may be inserted. Prokaryotic and eukaryotic expression systems allow 
various important functional domains of the protein to be recovered as fusion 

30 proteins and then used for binding, structural and functional studies and also for 
the generation of appropriate antibodies. 
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Eukaryotic expression systems permit appropriate post-translational 
modifications to expressed proteins. This allows for studies of the SIMP gene and 
gene product including determination of proper expression and post-translational 
modifications for biological activity, identifying regulatory elements located in the 5' 
5 region of the SIMP gene and their role in tissue regulation of protein expression. It 
also permits the production of large amounts of normal and mutant proteins for 
isolation and purification, to use cells expressing SIMP as a functional assay 
system for antibodies generated against the protein, to test the effectiveness of 
pharmacological agents or as a component of a signal transduction system, to 

10 study the function of the normal complete protein, specific portions of the protein, 
or of naturally occurring polymorphisms and artificially produced mutated proteins. 
The SIMP DNA sequence may be altered by using procedures such as restriction 
enzyme digestion, DNA polymerase fill-in, exonuclease deletion, terminal 
deoxynucleotide transferase extension, ligation of synthetic or cloned DNA 

15 sequences and site directed sequence alteration using specific oligonucleotides 
together with PCR. 

A SIMP polypeptide may be produced by a stably-transfected mammalian 
cell line. A number of vectors suitable for stable transfection of mammalian cells 
are available to the public, as are methods for constructing such cell lines. 

20 Once the recombinant protein is expressed, it is isolated by, for example, 

affinity chromatography. In one example, an anti-SIMP antibody, which may be 
produced by the methods described herein, can be attached to a column and used 
to isolate the SIMP protein. Lysis and fractionation of SIMP-harboring cells prior to 
affinity chromatography may be performed by standard methods. Once isolated, 

25 the recombinant protein can, if desired, be purified further. 

Methods and techniques for expressing recombinant proteins and foreign 
sequences in prokaryotes and eukaryotes are well known in the art and will not be 
described in more detail. One can refer, if necessary to Joseph Sambrook, David 
W. Russell, Joe Sambrook Molecular Cloning: A Laboratory Manual 2001 Cold 

30 Spring Harbor Laboratory Press. Those skilled in the art of molecular biology will 
understand that a wide variety of expression systems may be used to produce the 
recombinant protein. The precise host cell used is not critical to the invention. The 
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SIMP protein may be produced in a prokaryotic host (e.g., E. coli) or in a 
eukaryotic host (e.g., S. cerevisiae, insect cells such as Sf21 cells, or mammalian 
cells such as COS-1, NIH 3T3, or HeLa cells). These cells are publicly available, 
for example, from the American Type Culture Collection, Rockville, MD. The 
5 method of transduction and the choice of expression vehicle will depend on the 
host system selected. 

Polypeptides of the invention, particularly short SIMP fragments, may also 
be produced by chemical synthesis. These general techniques of polypeptide 
expression and purification can also be used to produce and isolate useful SIMP 

10 fragments or analogs, as described herein. 

The polypeptides of the present invention may also be incorporated in 
polypeptides of various length, preferably from about 8 to about 50 amino acids, 
an more preferably from about 8 to about 12 amino acids. According to a preferred 
embodiment, the peptides are incorporated in a tetrameric complex comprising a 

15 plurality of identical or different SIMP peptides/polypeptides according to the 
invention. According to another preferred embodiment, the peptides of the 
invention are incorporated into a support comprising at least two peptidic 
molecules. Examples of suitable supports include polymers, lipidic vesicles, 
microsphere, latex beads, polystyrene beads, proteins and the like. 

20 Skilled artisans will recognize that a mammalian SIMP, or a fragment 

thereof (as described herein), may serve as an active ingredient in a therapeutic 
composition. This composition, depending on the SIMP or fragment included, may 
be used to regulate cell proliferation, survival and apoptosis and thereby treat any 
condition that is caused by a disturbance in cell proliferation, accumulation or 

25 replacement. Thus, it will be understood that another aspect of the invention 
described herein, includes the compounds of the invention in a pharmaceutical^ 
acceptable carrier. 

ix) SIMP Antibodies 

30 The invention features a purified antibody (monoclonal and polyclonal) that 

specifically binds to a SIMP protein. 
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The antibodies of the invention may be prepared by a variety of methods 
using the SIMP proteins or polypeptides described above. For example, the SIMP 
polypeptide, or antigenic fragments thereof, may be administered to an animal in 
order to induce the production of polyclonal antibodies. Alternatively, antibodies 
5 used as described herein may be monoclonal antibodies, which are prepared 
using hybridoma technology (see, e.g., Hammerling ef a/., In Monoclonal 
Antibodies and T-Cell Hybridomas, Elsevier, NY, 1981). The invention features 
antibodies that specifically bind human or murine SIMP polypeptides, or fragments 
thereof. In particular, the invention features "neutralizing" antibodies. By 

10 "neutralizing" antibodies is meant antibodies that interfere with any of the biological 
activities of the SIMP polypeptide, particularly the ability of SIMP to inhibit 
apoptosis. The neutralizing antibody may reduce the ability of SIMP polypeptides 
to inhibit apoptosis by, preferably 50%, more preferably by 70%, and most 
preferably by 90% or more. Any standard assay of apoptosis, including those 

15 described herein, may be used to assess potentially neutralizing antibodies. Once 
produced, monoclonal and polyclonal antibodies are preferably tested for specific 
SIMP recognition by Western blot, immunoprecipitation analysis or any other 
suitable method. 

In addition to intact monoclonal and polyclonal anti-SIMP antibodies, the 
20 invention features various genetically engineered antibodies, humanized 
antibodies, and antibody fragments, including F(ab')2, Fab\ Fab, Fv and sFv 
fragments. Antibodies can be humanized by methods known in the art. Fully 
human antibodies, such as those expressed in transgenic animals, are also 
features of the invention. 
25 Antibodies that specifically recognize SIMP (or fragments of SIMP), such as 

those described herein, are considered useful to the invention. Such an antibody 
may be used in any standard immunodetection method for the detection, 
quantification, and purification of a SIMP polypeptide. Preferably, the antibody 
binds specifically to SIMP. The antibody may be a monoclonal or a polyclonal 
30 antibody and may be modified for diagnostic or for therapeutic purposes. The most 
preferable antibody binds the SIMP polypeptide sequences of SEQ. ID NO:1 
(hSIMP) and/or SEQ. ID NO:4 (mSIMP). 
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The antibodies of the invention may, for example, be used in an 
immunoassay to monitor SIMP expression levels, to determine the subcellular 
location of a SIMP or SIMP fragment produced by a mammal or to determine the 
amount of SIMP or fragment thereof in a biological sample. Antibodies that inhibit 
5 SIMP described herein may be especially useful for conditions where decreased 
SIMP function would be advantageous that is, inhibition of cancer cell proliferation, 
prevention of rejection and the treatment of autoimmune disease. In addition, the 
antibodies may be coupled to compounds for diagnostic and/or therapeutic uses 
such as radionucleotides for imaging and therapy and liposomes for the targeting 
10 of compounds to a specific tissue location. The antibodies may also be labeled 
(e.g. immunofluorescence) for easier detection. 

x) Assessment of SIMP intracellular or extracellular levels 

As noted, the antibodies described above may be used to monitor SIMP 

15 protein expression and/or to determine the amount of SIMP or fragment thereof in 
a biological sample. 

In addition, in situ hybridization may be used to detect the expression of the 
SIMP gene. As it is well known in the art, in situ hybridization relies upon the 
hybridization of a specifically labeled nucleic acid probe to the cellular RNA in 

20 individual cells or tissues. Therefore, oligonucleotides or cloned nucleotide (RNA 
or DNA) fragments corresponding to unique portions of the SIMP gene may be 
used to asses SIMP cellular levels or detect specific mRNA species. Such an 
assessment may also be done in vitro using well known methods (Northern 
analysis, quantitative PCR, etc.) 

25 Determination of the amount of SIMP or fragment thereof in a biological 

sample may be especially useful for diagnosing a cell proliferative disease or an 
increased likelihood of such a disease, particularly in a human subject, using a 
SIMP nucleic acid probe or SIMP antibody. Preferably the disease is a rapidly 
growing cancer or a cancer that displays a short doubling time (e.g. hematopoietic 

30 cancer, lung cancers, prostate cancer, testis cancer, breast cancer, melanomas, 
pancreatic cancer intestine cancers, sarcomas, prostate cancer and hematologic 
cancers). This may be achieved by contacting, in vitro or in vivo, a biological 
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sample (such as a blood sample or a tissue biopsy) from an individual suspected 
of harboring cancer cells, with a SIMP antibody or a probe according to the 
invention, in order to evaluate the amount of SIMP in the sample or the cells 
therein. The measured amount would be indicative of the probability of the subject 
5 of having proliferating tumoral cells since it is expected that these cells have a 
higher level of SIMP expression. 

In a related aspect, the invention features a method for detecting the 
expression of SIMP in tissues comprising, i) providing a tissue or cellular sample; 
ii) incubating said sample with an anti-SIMP polyclonal or monoclonal antibody; 

10 and iii) visualizing the distribution of SIMP. 

Assay kits for determining the amount of SIMP in a sample would also be 
useful and are within the scope of the present invention. Such a kit would 
preferably comprise SIMP antibody(ies) or probe(s) according to the invention and 
at least one element selected from the group consisting of instructions for using 

15 the kit, assay tubes, enzymes, reagents or reaction buffer(s), enzyme(s). 

xi) Identification of Molecules that Modulate SIMP Protein Expression 

SIMP cDNAs may be used to facilitate the identification of molecules that 
increase or decrease SIMP expression. In one approach, candidate molecules are 

20 added, in varying concentration, to the culture medium of cells expressing SIMP 
mRNA. SIMP expression is then measured, for example, by Northern blot analysis 
using a SIMP cDNA, or cDNA or RNA fragment, as a hybridization probe. The 
level of SIMP expression in the presence of the candidate molecule is compared to 
the level of SIMP expression in the absence of the candidate molecule, all other 

25 factors (e.g. cell type and culture conditions) being equal. 

Compounds that modulate the level of SIMP may be purified, or 
substantially purified, or may be one component of a mixture of compounds such 
as an extract or supernatant obtained from cells (Ausubel et al., supra). In an 
assay of a mixture of compounds, SIMP expression is tested against progressively 

30 smaller subsets of the compound pool (e.g., produced by standard purification 
techniques such as HPLC or FPLC) until a single compound or minimal number of 
effective compounds is demonstrated to modulate SIMP expression. 
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Compounds may also be screened for their ability to modulate SIMP- 
biological activity (e.g. enhancement of cell growth, inhibition of apoptosis, protein 
glycosylation, generation of MHC-associated SIMP-derived peptides). In this 
approach, the biological activity of SIMP or of a cell expressing SIMP (e.g. 
5 lymphocytes or a cancer cell) in the presence of a candidate compound is 
compared to the biological activity in its absence, under equivalent conditions. 
Again, the screen may begin with a pool of candidate compounds, from which one 
or more useful modulator compounds are isolated in a step-wise fashion. The 
SIMP or cell biological activity may be measured by any suitable standard assay. 

10 The effect of candidate molecules on SIMP-biological activity may, instead, 

be measured at the level of translation by using the general approach described 
above with standard protein detection techniques, such as Western blotting or 
immunoprecipitation with a SIMP-specific antibody (for example, the SIMP 
antibody described herein). 

15 Another method for detecting compounds that modulate the activity of 

SIMPs is to screen for compounds that interact physically with a given SIMP 
polypeptide. Depending on the nature of the compounds to be tested, the binding 
interaction may be measured using methods such as enzyme-linked 
immunosorbent assays (ELISA), filter binding assays, FRET assays, scintillation 

20 proximity assays, microscopic visualization, immunostaining of the cells, in situ 
hybridization, PCR, etc. 

A molecule that promotes an increase in SIMP expression or SIMP activity 
is considered particularly useful to the invention; such a molecule may be used, for 
example, as a therapeutic to increase cellular levels of SIMP and thereby exploit 

25 the ability of SIMP polypeptides to increase the efficacy and/or duration of a T-cell 
response. 

A molecule that decreases SIMP activity (e.g., by decreasing SIMP gene 
expression or polypeptide activity) may be used to decrease cellular proliferation. 
This would be advantageous in the treatment of cancer, particularly hematopoietic 
30 cancers, or other cell proliferative diseases. 

Molecules that are found, by the methods described above, to effectively 
modulate SIMP gene expression or polypeptide activity, may be tested further in 
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animal models. If they continue to function successfully in an in vivo setting, they 
may be used as therapeutics to either increase the efficacy and/or duration of a T- 
cell response, or to inhibit tumoral cell survival. 

5 xii) Construction of Transgenic Animal 

Previous studies have shown that the B6 dom1 (i.e. mSIMP-derived) MiHA 
displays several important specific features: i) it is highly immunogenic 
(immunodominant) for T-lymphocytes; ii) the number of MHC-associated B6 dom1 
copies per cell is higher than for any other endogenous MHC class l-associated 

10 peptides; iii) the expression of B6 dom1 (at the level of MHC-associated peptides) is 
dramatically increased (128-fold) on activated T-cells relative to resting 
splenocytes; and iv) B6 dom1 is an ideal target for adoptive immunotherapy of 
hematologic malignancies. 

Study of these important features at the molecular level was hampered by 

15 the fact that the identity of gene encoding this peptide as well as the exact peptide 
sequence of the B6 dom1 MiHA were unknown. Discovery that the B6 dom1 MiHA is 
encoded by the SIMP gene and that the exact sequence of the B6 dom1 MiHA is 
KAPDNRETL (see exemplification section) will allow for the generation of 1) 
transgenic mice that express the SIMP gene or SIMP mutants at various levels in 

20 one or multiple cell lineages, 2) knock-out mice in which expression of the 
endogenous SIMP gene is either prevented or regulated in one or multiple cell 
lineages. 

Characterization of SIMP genes provides information that is necessary for a 
SIMP knockout animal model to be developed by homologous recombination. 
25 Preferably, the model is a mammalian animal, most preferably a mouse. Similarly, 
an animal model of SIMP overproduction may be generated by integrating one or 
more SIMP sequences into the genome, according to standard transgenic 
techniques. 

Two types of transgenic mice could be generated initially: one expressing 
30 the SIMP gene ubiquitously, the other expressing SIMP selectively in T- 
lymphocytes. The site of expression could be determined according to the nature 
of the promoter gene to which the SIMP transgene will be coupled. Ubiquitous 
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expression of SIMP would allow to identify which tissues and organs are most 
sensitive to SIMP overexpression. Expression in T-cells would allow to assess to 
which extent overexpression of SIMP would affect the level and specificity of 
immune responses. Because a complete "standard knockout" would probably be 
5 not viable, it would be preferable to generate conditional knockouts where the 
SIMP gene expression would be inhibited at a precise time and only in selected 
tissue or organs using previously described methods (e.g. Labrecque et al., 
Immunity 15, 71-82; Polic et al.,, Proc. Natl. Acad. Sci. U. S. A 98, 8744-8749). 
Knockout and transgenic mice would provide the means, in vivo, to study SIMP 
10 cellular biology (glycosylation, antigen processing, cell proliferation) and/or to 
screen for therapeutic compounds. 



EXAMPLES 



15 The examples are meant to illustrate, not to limit the invention. 



EXAMPLE 1: Discovery of the mouse gene encoding the B6 dom1 MiHA 
Background 

B gdomi j s an j mmunoc | om j nan t ubiquitous mice MiHA (Fontaine et al., 
20 (2001). Nat Med. 7:789-794). Although the immunogenic properties of B6 dom1 have 
been characterized (Eden etai, (1999) J. Immunol. 162:4502-4510), the identity of 
the gene and the protein from which the B6 dom1 peptide was derived have 
remained unknown until now. 



25 Materials and methods 

Isolation of mouse tissue RNA 

For initial isolation of cDNA encoding the putative B6 dom1 peptide, total RNA 
was isolated from various tissues of C57BL/6J mice or from the congenic B10.H7 b 
mouse strain. Routinely, a piece of liver (100mg) was placed in 1ml of TRIZOL™, 
30 and was subsequently homogenized using a hand-held mini-Potter homogenizer. 
Samples were allowed to stand for 5 min. at room temperature to fully dissociate 
nucleoprotein complexes; 200jal of chloroform was added and mixed vigorously, 
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after which samples were again left at room temperature for 2 min, followed by 
centrifugation at 12,000g for 15 mins at 4 C. The aqueous (upper) phase was 
transferred to a clean tube, 500|al of isopropanol was added, samples were mixed 
and left at room temperature for 10 min, followed by centrifugation for 10 min as 
5 above. Pellets were washed in 1ml of 75% ethanol, centrifuged at 7,500g for 10 
min at 4°C, dried briefly in the air, and then resuspended in 200|il RNAse-free 
water. The OD 2 6o was used to determine the concentration of the RNA obtained, 
which was usually well in excess of 1|xg/|il when mouse liver was used. 

10 RT-PCR amplification of mouse SIMP cDNA 

Total RNA prepared from mouse tissues was used as a template for 
subsequent RT-PCR reactions. First strand cDNA synthesis was performed using 
standard protocols. Briefly, a poly d(T) oligo (20pmol) was used to prime a reverse 
transcription reaction using 1 jag of mouse RNA and 200U of Superscript reverse 

15 transcriptase, and the reaction was allowed to proceed for one hour at 42°C. This 
product was then used as a template for PCR-mediated amplification of a mouse 
SIMP fragment (~ 400bp) using oligonucleotides specific for the mouse gene. The 
oligonucleotides used were 5'-GAGAGTTCCGAGTAGAC-3 f (SEQ ID NO:315) 
(sense strand, corresponding to mouse SIMP nucleotides 2166-2182) and 5'- 

20 GCGTTCTCTCAAGGACTGCTG-3' (SEQ ID NO:316) (anti-sense strand, 
corresponding to SIMP nucleotides 2592-2572). PCR conditions were 94 °C for 3 
min, followed by 30 cycles consisting of 94 °C for 30s, 60 °C for 30s and 68 °C for 
3 min, with a final extension of 10 min at 68 °C. The enzyme used for PCR was 
Pfx polymerase (Gibco). 

25 Full length B6 and B10.H7 b SIMP cDNA was isolated in a similar fashion 

with the single exception that a SIMP 5 f end-specific oligonucleotide corresponding 
to nucleotides 41-59 was used with the 3* oligonucleotide outlined above 
(nucleotides 2592-2572) to amplify the 2469bp coding sequence. 

30 DNA sequencing 

Dideoxynucleotide DNA sequencing was performed using both manual and 
automated systems. For manual routine sequencing of small PCR products, we 
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used the Redivue ^P-ddNTP Terminator Cycle sequencing kit (Amerhsam 
Pharmacia Biotech), using the PCR-mediated protocol suggested by the 
manfacturer. For sequencing of full-length SIMP clones an automated dye 
terminator system was used and performed by the DNA sequencing facility at BRI. 
5 Oligonucleotides specific for mouse SIMP were chosen so as to allow reading of 
the entire sequence using five oligonucleotides. 



Cytotoxicity assays 

Cytotoxic activity was assessed in a standard 51 Cr release assay (Pion et 
10 al.,1997. EurJ. Immunol. 27:421-430). Target blast cells, prepared by culturing 

C3H.SW spleen cells (3 x 10 6 /ml) with 5 pg/ml of Concanavalin A (Con A; Sigma 
Chemical Co., St-Louis, MO) for 48 hours, were labeled with 100 pCi Na 2 51 Cr 
(Dupont Co., Wilmington, DE) for 90 minutes, sensitized with synthetic peptides for 
90 minutes, then mixed with C3H.SW anti-C57BL/6 effector cells at a 50:1 effector 

15 to target ratio. Cells were then incubated for 4 hours at 37°C in a humidified 
atmosphere of 5% CO2. Afterwards, supernatants were harvested and counted in 
a gamma counter. All tests were done in triplicate. Spontaneous release was 
below 15%. Results are expressed as a percentage of specific lysis calculated as 
follows: % specific lysis = 100 x (experimental release - spontaneous 

20 release)/(maximum release - spontaneous release). 



Results 

Identification of a candidate gene using bioinformatic tools 

Elution of peptides from B6 dom1 positive cells, HPLC separation and T-cell 

25 mediated lysis assay were previously used to identify fractions containing peptides 
corresponding to mouse B6 dom1 . These peptides were then subjected to Edman 
degradation for peptide sequencing, and the sequence AAPDNRETF was 
obtained as the best candidate for the immunodominant mouse B6 dom1 peptide, 
although preliminary searches in databanks revealed that no known mouse (or 

30 human) protein contained this nonameric sequence. While we were confident that 
this peptide was biochemically very similar to that encoded by the mouse B6 dom1 
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gene, we did not rule out the possibility that it was not 100% identical to the native 
peptide. 

Blasts of the mouse genome which were selected for candidates that were 
similar but not identical to the putative B6 dom1 peptide, revealed that one gene in 
5 particular was a strong candidate, potentially encoding B6 dom1 . This gene 
(Accession no. AK018758) does not have a formal name nor assigned biological 
role, but contains an open reading frame of 2469 bp and encodes a protein of 
some 823 amino acids. The candidate peptide from this protein has the sequence 
KAPDNRETL, differing only at positions 1 and 9 respectively from the original 

10 candidate. Since B6 dom1 is an H2D b -associated peptide of which positions 4, 6 and 
7 appear to be critical contact residues for T-cell recognition (Perreault et al., 
J.Clin. Invest. 98:622-628), KAPDNRETL was considered a very strong candidate 
given that these amino acids are conserved. It was also evident from databank 
analysis that this gene seems to be fairly ubiquituously expressed, which was 

15 consistent with data we had previously obtained for B6 dom1 in mouse tissues 17 . 
Given that this gene was by far the best candidate obtained (in terms of homology 
with the putative AAPDNRETF sequence), we decided to further investigate its 
potential role as the source of the immunodominant MiHA, B6 dom1 . 

20 Phenotype/genotype correlation: qenotyping of 8 strains of mice (4 positive for 
B6 dom1 . 4 negative) 

A fundamental requirement for identification of the candidate gene as the 
one encoding B6 dom1 was that there had to be relevant differences in the coding 
sequences between B6 dom1+ and B6 dom1 " strains of mice; more specifically, for an 

25 ideal candidate there had to be sequence divergence in or adjacent to the 27bp 
region encoding KAPDNRETL, the putative B6 dom1 nonamer. 

Initially, we therefore decided to compare the sequence of this region of the 
candidate gene between the B6 parental strain (positive) and the B10.H7 b 
congenic strain (negative). Using mouse tissue cDNA and oligonucleotides specific 

30 for the candidate gene (designed based on the DNA sequence obtained from 
Genebank™), we amplified a region consisting of roughly the last 400bp of the* 
candidate gene, which encodes a sequence containing the nine amino acid 
candidate peptide. The results from this analysis were of great importance 
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because we found that the B10.H7 b mice contained only two single nucleotide 
mutations in this 400bp fragment: one which did not alter the amino acid 
sequence, and another (GAG to GAT) within the 27bp region outlined above, 
which changed the sequence of the B6 dom1 candidate peptide from KAPDNRETL 
5 (SEQ ID NO:313) to KAPDNRDTL (SEQ ID NO:314). This was very strong 
evidence that the candidate gene indeed coded for B6 dom1 , not least because this 
amino acid change was found at position 7 in the peptide, and this position is very 
important for contact with the TCR 15 . This result made it critical to examine other 
mouse strains to see whether the E to D mutation was a characteristic of the other 
10 B6 dom1 -negative strains, which would further support the contention that 
KAPDNRETL was indeed the native B6 dom1 sequence, encoded by our candidate 
gene. 

The B6, B10, LP, and 129 strains are all positive for B6 dom1 , while the A. BY, 
B10.H7 b , C3H.SW, and BALB.B strains are negative 16 . Summarized in Table 4 

15 below are the results of the sequence analysis of the candidate peptide as 
encoded by the cDNA from the various strains. Of note, the fact that a mouse 
strain is said to be B6 dom1 -negative, does not mean that the AK018758 gene (SEQ 
ID NO:319) is not expressed but rather that the sequence of its AK018758 gene is 
different from that of B6 dom1 -positive mice (it does not code for the exact 

20 nonapeptide sequence recognized by B6 dom1 -specific T-cells but rather codes for 
an allelic product). 

Table 4. Genotype/phenotype comparisons 

25 



30 



Strain 


ggDOMl 


Sequence 


B6 


+ 


KAPDNRETL 


B10 


+ 


KAPDNRETL 


LP 


+ 


KAPDNRETL 


129 


+ 


KAPDNRETL 


A. BY 




KAPDNRDTL 


B10.H7 b 




KAPDNRDTL 


BALB.B 




KAPDNRDTL 


C3H.SW 




KAPDNRDTL 
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These data were totally supportive of the hypothesis that the AK018758 
gene was indeed the gene encoding the B6 dom1 MiHA because (a) in each case 
only one mutation encoding an amino acid substitution was observed between 
strains in the 400bp region amplified by PCR, and (b) this mutation was identical in 
5 nature and position in each B6 dom1 -negative strain i.e. GAG to GAT (E to D). In ail 
cases B6 dom1 positive strains were identical to the parental B6 strain. Collectively 
these data are consistent with the hypothesis that we have identified (and 
subsequently cloned) the gene encoding mouse B6 dom1 . At this point we decided to 
compare the biological activity of the wild-type and mutant peptides to determine 
10 whether the peptides KAPDNRETL and KAPDNRDTL were targets for B6 dom1 - 
specific T-cell receptor-mediated recognition and cell lysis. 

Recognition of the KAPDNRETL (SEQ ID NO:313) and KAPDNRDTL (SEQ ID 
NO:314) peptides by B6 dom1 -specific CTLs 

15 In order to prove that the KAPDNRETL peptide was the epitope recognised 

by B6 dom1 -specific T-cells, we tested whether anti-B6 dom1 T-cells (from C3H.SW 
mice immunised with B6 cells) would kill C3H.SW target cells coated with each of 
the following synthetic peptides: AAPDNRETF (SEQ ID NO:312) (previously 
shown to be similar to the B6 dom1 peptide because it was recognised by B6 dom1 - 

20 specific T-cells), KAPDNRETL (SEQ ID NO:313) (the peptide now presumed to be 
the natural B6 dom1 epitope expressed in B6 dom1+ mice) and KAPDNRDTL (SEQ ID 
NO:314) (the product of the putative B6 dom1 allele found in B6 dom1 - strains of mice). 
Strikingly, the KAPDNRETL (SEQ ID NO:313) peptide was recognised more 
efficiently than the AAPDNRETF (SEQ ID NO:312) peptide at a 10 -8 M 

25 concentration while the KAPDNRDTL peptide (SEQ ID NO:314) was not 
recognised even at a 10" 5 M concentration (Figure 1). Altogether, these results 
show that KAPDNRETL (SEQ ID NO:313) represents the real natural peptide 
recognised by B6 dom1 -specific T-cells, that it is encoded by the AK018758 gene, 
and that following a single nucleotide substitution the sequence found in B6 dom1 ~ 

30 mice, corresponds to KAPDNRDTL (SEQ ID NO:314). Since i)AK018758 
encodes B6 dom1 and ii) we found that a human homolog comprises numerous 
peptide sequences that possess a high affinity binding motif for HLA class I 
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molecules (see example 2), the gene encoding mouse B6 dom1 was renamed 
mouse "SIMP", that is a Source of immunodominant MHC-associated Peptides. 

EXAMPLE 2: Discovery of the human SIMP 
5 Background 

Given that the SIMP protein and peptides derived therefrom seemed to 
represent an ideal target for adoptive cancer immunotherapy, we proceeded to the 
identification of the human homolog of SIMP. 



10 Materials and methods 

Isolation of full length human SIMP by RT-PCR 

Human SIMP cDNA was isolated by RT-PCR using human total cDNA as 
template (generated in an identical fashion to mouse cDNA, as described above). 
The oligonucleotides used for PCR were 5-GCGGAGGACGA GCGAGACC-3' 
15 (SEQ ID NO:317) (sense) and 5'-CGGTTCTCACAAGGACAACTGC-3' (SEQ ID 
NO:318) (anti-sense) to amplify the 2478bp coding sequence (826 amino acids). 
PCR products were obtained from cDNAs isolated from several donors and 
individually sequenced to confirm the human SIMP gene sequence. 



20 Results 

Although the human genome has been sequenced, a full length human 
equivalent of mouse SIMP has not been identified or cloned. Blasts of the human 
genome nevertheless suggested that there was a human SIMP homolog. One 
sequence is referred to as "(moderately) similar to oligosaccharyltransferase STT3 

25 subunit", and corresponds to the last 286 amino acids of mouse SIMP (Accession 
no AK027789). Also, GenomeScan™ analysis (a new feature available in the 
human genome databank) of the human genome indicates that AK027789 (SEQ 
ID NO:320) is located on chromosome 3. Thus, the existence of a human SIMP 
homolog is suggested by i) the existence of a human sequence whose putative 

30 protein products would be similar to the C-terminal part of the mouse SIMP protein 
and ii) the fact that this sequence was mapped to human chromosome 3, a region 
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that corresponds to the telomeric end of mouse chromosome 9 (the region 
encoding the B6 dom1 MiHA, and thus, where the mouse SIMP gene is located). 

Based upon available DNA sequence, we designed an oligo specific for the 
3' end of the human sequence and used this with an oligo that was specific for the 
5 5' end of the mouse sequence in RT-PCR experiments using human RNA. We 
were successful in amplifying a roughly 2,500bp fragment containing the entire 
coding sequence of human SIMP: this sequence is identified in the sequence 
listing section as SEQ ID NO:1 and the protein product encoded by this gene is 
identified as SEQ ID NO:2. The initiating Met codon (ATG) and termination stop 
10 codons (TAA) are shown, at the beginning and the end of the sequence 
respectively. 



Discussion 

We have previously shown that adoptive T-cell immunotherapy targeted to 

15 B6 dom1 , a peptide encoded by the mouse SIMP gene, could eradicate cancer cells 
without causing GVHD. Based on the work reported herein, we have identified the 
mouse B6 dom1 gene (mSIMP), cloned its human homolog (hSIMP), and discovered 
that the product of the human gene contains peptide sequences with a high affinity 
binding motif for HLA molecules. Interestingly, the yeast analog of the mouse and 

20 human SIMP gene, STT3, is essential for cell proliferation. We intend to evaluate 
whether expression of human SIMP gene is required for cancer cell proliferation. 
The logical assumption that this is also the case for cancer cells (that is, they need 
to express the SIMP gene to proliferate) has important mechanistic implications 
because this provides a sound basis for the remarkable efficacy of SIMP-targeted 

25 immunotherapy. Accordingly, cancer cells cannot downregulate expression of this 
gene to evade T-cells targeted to products of the SIMP gene because SIMP 
expression is essential for their proliferation. 

Having identified SIMP-encoded peptides with a high affinity binding motif 
for HLA molecules, we propose to use these peptides as targets for cancer 

30 immunotherapy. Selection of the most appropriate peptides will be based on two 
parameters: i) the level of expression of these peptides on various types of cancer 
cells (breast, prostate, lung, kidney, skin, lympho-hematopoietic tissues etc); ii) 
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whether these peptides are polymorphic or not. Polymorphic peptides (MiHAs) will 
be targeted with T-cells expressing self-MHC-restricted TCR whereas non 
polymorphic peptides will be targeted with T-cells expressing allo-MHC TCR. 
Targeting can be achieved by injection of alloreactive donor T-cells or by injection 
5 of recipient T-cells transfected with the genes encoding an alloreactive TCR 
(derived from a human or an animal donor). 

While several embodiments of the invention have been described, it will be 
understood that the present invention is capable of further modifications, and this 

10 application is intended to cover any variations, uses, or adaptations of the 
invention, following in general the principles of the invention and including such 
departures from the present disclosure as to come within knowledge or customary 
practice in the art to which the invention pertains, and as may be applied to the 
essential features hereinbefore set forth and falling within the scope of the 

15 invention or the limits of the appended claims. 
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